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NOVEL STREPTOCOCCUS ANTIGENS 

This application claims priority from US patent application 60/113,800 filed 
december 23 1998 which is herein incorporated by reference. 

5 FIELD OF THE INVENTION 

The present invention is related to antigens, more particularly 
protein antigens of streptococcus pneumoniaepathogen which are 
useful as vaccine components for therapy and/or prophylaxis. 

10 

BACKGROUND OF THE INVENTION 

S . pneumoniae is an important agent of disease in man especially 
among infants, the elderly and immunocompromised persons. It is 

15 a bacterium frequently isolated from patients with invasive 

diseases such as bacteraemia/septicaemia, pneumonia, meningitis 
with high morbidity and mortality throughout the world. Even 
with appropriate antibiotic therapy, pneumococcal infections 
still result in many deaths. Although the advent of 

20 antimicrobial drugs has reduced the overall mortality from 
pneumococcal disease, the presence of resistant pneumococcal 
organisms has become a major problem in the world today. 
Effective pneumococcal vaccines could have a major impact on the 
morbidity and mortality associated with S_;_ pneumoniae disease. 

25 Such vaccines would also potentially be useful to prevent otitis 
media in infants and young children. 

Efforts to develop a pneumococcal vaccine have generally 
concentrated on generating immune responses to the pneumococcal 
30 capsular polysaccharide. More than 80 pneumococcal capsular 
serotypes have been identified on the basis of antigenic 
differences. The currently available pneumococcal vaccine, 
comprising 23 capsular polysaccharides that most frequently 

1 



caused disease, has significant shortcomings related primarily 
to the poor immunogenicity of some capsular polysaccharides, the 
diversity of the serotypes and the differences in the 
distribution of serotypes over time, geographic areas and age 
5 groups. In particular, the failure of existing vaccines and 

capsular conjugate vaccines currently in development to protect 
young children against all serotypes spurres evaluation of other 
S . pneumoniae components. Although immunogenicity of capsular 
polysaccharides can be improved, serotype specificity will still 
10 represent a major limitation of polysaccharide-based vaccines. 
The use of a antigenically conserved immunogenic pneumococcal 
protein antigen, either by itself or in combination with 
additional components, offers the possibility of a protein-based 
pneumococcal vaccine . 

15 

PCT Publication number WO98/1893 0 published may 7 1998 entitled 
"Streptococcus Pneumoniae antigens and vaccines" describes 
certain polypeptides which are claimed to be antigenic. However, 
no biological activity of these polypeptides is reported. 

20 

Therefore their remains an unmet need for Streptococcus 
antigens that may be used as vaccine components for the 
prophylaxis and/or therapy of Streptococcus infection. 

25 

SUMMARY OF THE INVENTION 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
30 70% identity to a second polypeptide comprising a sequence 

chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 
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In other aspects, there are provided vectors comprising 
polynucleotides of the invention operably linked to an 
expression control region, as well as host cells transfected 
with said vectors and methods of producing polypeptides 
5 comprising culturing said host cells under conditions suitable 
for expression. 

In yet another aspect, there are provided novel polypeptides 
encoded by polynucleotides of the invention. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is the DNA sequence of BVH-3 gene; SEQ ID NO: 1. 

15 

Figure 2 is the amino acid sequence of BVH-3 protein; SEQ ID NO: 
2. 

Figure 3 is the DNA sequence of BVH-11 gene; SEQ ID NO: 3. 

20 

Figure 4 is the amino acid sequence of BVH-11 protein; SEQ ID 
NO: 4. 

Figure 5 is the DNA sequence of BVH-28 gene; SEQ ID NO: 5. 

25 

Figure 6 is the amino acid sequence of BVH-2 8 protein; SEQ ID 
NO: 6. 

Figure 7 is the DNA sequence of BVH-3A gene which corresponds to 
30 the 5 ' terminal end of BVH-3; SEQ ID NO: 1. 
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Figure 8 is the amino acid sequence of BVH-3A protein; SEQ ID 
NO: 8. 

Figure 9 is the DNA sequence of BVH-3B gene which corresponds to 
5 the 3' terminal end of BVH-3; SEQ ID NO: 9. 

Figure 10 is the amino acid sequence of BVH-3B protein; SEQ ID 
NO: 10. 

10 

Figure 11 depicts the comparison of the predicted amino acid 
sequences of the BVH-3 open reading frames from WU2 , RX1, 
JNR.7/87, SP64, P4241 and A66 pneumoniae strains by using 
the program Clustal W from MacVector sequence analysis software 
15 (version 6-5) . Underneath the alignment, there is a consensus 
line where * and . characters indicate identical and similar 
amino acid residues, respectively. 

Figure 12 depicts the comparison of the predicted amino acid 
20 sequences of the BVH-11 open reading frames from WU2 , Rxl, 

JNR.7/87, SP64, P4241, A66 and SP63 pneumoniae strains by 
using the program Clustal W from MacVector sequence analysis 
software (version 6.5). Underneath the alignment, there is a 
consensus line where * and . characters indicate identical and 
25 similar amino acid residues, respectively. 

Figure 13 depicts the comparison of the predicted amino acid 
sequences of the BVH-11 proteins from various pneumoniae 
strains. The degrees of identity (I) and similarity (S) were 
30 determined by using the program Clustal W from MacVector 
sequence analysis software (version 6.5). 
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Figure 14 is a DNA sequence containing the complete BVH-3 gene 
(open reading frame " ORF " at nucleotides 1777 to 4896); SEQ ID 
NO: 11. 



5 Figure 15 is a DNA sequence containing the complete BVH-11 gene 
(ORF at nucleotides 45 to 2567); SEQ ID NO: 12. 



Figure 16 is a DNA sequence containing the complete BVH-11-2 
gene (ORF at nucleotides 114 to 2630); SEQ ID NO: 13. 

Figure 17 is the amino acid sequence of BVH-11-2 protein; SEQ 
ID NO: 14. 



Figure 18 is the DNA sequence of SP63 BVH-3 gene; SEQ ID NO: 15. 

15 

Figure 19 is the amino acid sequence of SP63 BVH-3 protein; SEQ 
ID NO: 16. 



Figure 20 is the amino acid sequence of BVH-3M protein; SEQ ID 
20 NO: 55. 



Figure 21 is the amino acid sequence of BVH-3AD protein; SEQ ID 
NO: 56. 



25 Figure 22 is the amino acid sequence of L- BVH-3 -AD protein; SEQ 
ID NO: 57. 



Figure 2 3 is the amino acid sequence of NEW12 protein; SEQ ID 
NO: 58. 

Figure 24 is the amino acid sequence of BVH-3C protein; SEQ ID 
NO: 59. 
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Figure 25 is the amino acid sequence of BVH-11M protein; SEQ ID 
NO: 60 . 

5 Figure 26 is the amino acid sequence of BVH-11A protein; SEQ ID 
NO: 61. 

Figure 27 is the amino acid sequence of BVH-11B (also called 
Newl3) protein; SEQ ID NO: 62. 

10 

Figure 28 is the amino acid sequence of BVH-11C protein; SEQ ID 
NO: 63. 

Figure 2 9 is the amino acid sequence of NEW1 protein; SEQ ID 
15 NO: 64. 

Figure 3 0 is the amino acid sequence of NEW2 protein; SEQ ID 
NO: 65. 

20 Figure 31 is the amino acid sequence of NEW3 protein; SEQ ID 
NO: 66. 

Figure 32 is the amino acid sequence of NEW4 protein; SEQ ID 
NO: 67. 

25 

Figure 33 is the amino acid sequence of NEW 5 protein; SEQ ID 
NO: 68. 

Figure 34 is the amino acid sequence of NEW 6 protein; SEQ ID 
30 NO: 69. 



Figure 35 is the amino acid sequence of NEW 7 protein; SEQ ID 
NO: 70. 

Figure 36 is the amino acid sequence of NEW8 protein; SEQ ID 
5 NO: 71. 

Figure 3 7 is the amino acid sequence of NEW 9 protein; SEQ ID 
NO: 72. 

10 Figure 38 is the amino acid sequence of BVH-11-2M protein; SEQ 
ID NO: 73. 

Figure 3 9 is the amino acid sequence of NEW10 protein; SEQ ID 
NO: 74. 

15 

Figure 4 0 is the amino acid sequence of NEW11 protein; SEQ ID 
NO: 75. 

Figure 41 is the DNA sequence of NEW 12 gene; SEQ ID NO: 76. 

20 

Figure 42 is the amino acid sequence of NEW14 protein; SEQ ID 
NO: 77. 

Figure 43 is the amino acid sequence of NEW 15 protein; SEQ ID 
25 NO: 78. 

Figure 44 is the amino acid sequence of NEW16 protein; SEQ ID 
NO: 79. 

30 Figure 4 5 is the DNA sequence of GBS BVH-71 gene; SEQ ID NO: 
80. 
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Figure 46 is the amino acid sequence of GBS BVH-71 protein; 
SEQ ID NO: 81. 

Figure 47 is the DNA sequence of GAS BVH-71 gene; SEQ ID NO: 82. 

5 

Figure 48 is the amino acid sequence of GAS BVH-71 protein; SEQ 
ID NO: 83. 

10 DETAILED DESCRIPTION OF THE INVENTION 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
15 chosen from SEQ ID NOs: 2, 4 # 6, 8, 10 , 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
20 95% identity to a second polypeptide comprising a sequence 

chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
25 isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 

30 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
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chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 75, 77 to 79, 

81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
5 isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 to 79 
or fragments, analogs or derivatives thereof. 

10 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 8, 10, 16, 55, 56, 57, 58, 59, 64, 
65, 66, 78 or fragments, analogs or derivatives thereof. 

15 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 8, 10, 16, 55, 56, 57, 59, 64, 65, 
20 66, 78 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
25 chosen from SEQ ID NOs: 4, 14, 58, 60, 61, 62, 63, 67, 68, 69, 
70, 71, 72, 73, 74, 75, 77, 79 or fragments, analogs or 
derivatives thereof . 

According to one aspect, the present invention provides an 
30 isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 4, 14, 60, 61, 62, 63, 67, 68, 69, 70, 
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X r 

71, 72, 73, 74, 75, 77, 79 or fragments, analogs or derivatives 
thereof . 

5 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 75, 77 to 79 or 
fragments, analogs or derivatives thereof. 

10 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence chosen 
from SEQ ID NOs: 10, 55 to 75, 77, 78, 79 or fragments, analogs 
15 or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence chosen 
20 from SEQ ID NOs: 55 to 75, 77, 78, 79 or fragments, analogs or 
derivatives thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
25 70% identity to a second polypeptide comprising a sequence 

chosen from SEQ ID NOs: 2, 4, 6, 8, 10 or fragments, analogs or 
derivatives thereof . 

30 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
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chosen from SEQ ID NOs: 2, 4, 10, 14 , 16 or fragments, analogs 
or derivatives thereof. 

According to one aspect, the present invention provides an 
5 isolated polynucleotide encoding a polypeptide having at least 
7 0% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 4, 14, 16 or fragments, analogs or 
derivatives thereof. 

10 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 2 or fragments, analogs or derivatives thereof. 

15 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 4 or fragments, analogs or derivatives thereof. 

20 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 10 or fragments, analogs or derivatives thereof. 

25 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 14 or fragments, analogs or derivatives thereof. 

30 According to one aspect, the present invention provides an 

isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 



11 



NO: 16 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
5 70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 58 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
10 70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 60 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
15 70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 62 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
20 70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 64 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
25 70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 67 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
30 70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 68 or fragments, analogs or derivatives thereof. 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 69 or fragments, analogs or derivatives thereof. 

5 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 72 or fragments, analogs or derivatives thereof. 

10 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 74 or fragments, analogs or derivatives thereof. 

15 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising sequence SEQ ID 
NO: 77 or fragments, analogs or derivatives thereof. 

20 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NOs: 2, 4, 6, 8, 10 or fragments, analogs or 
derivatives thereof . 

25 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 to 79, 
81, 83 or fragments, analogs or derivatives thereof. 

30 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence chosen 
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from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 to 79, 81, 

83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
5 polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NOs: 2, 4 , 10, 14, 16, 55 to 75, 77 to 79, 81, 83 
or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
10 polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 to 79 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
15 polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 75, 77 to 79 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
20 polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NOs: 2, 4, 10, 14, 16 or fragments, analogs or 
derivatives thereof . 

According to one aspect, the present invention relates to 
25 polypeptides characterized by the amino acid sequence 

comprising sequence SEQ ID NO: 2 or fragments, analogs or 
derivatives thereof . 

According to one aspect, the present invention relates to 
30 polypeptides characterized by the amino acid sequence 

comprising sequence SEQ ID NO: 4 or fragments, analogs or 
derivatives thereof . 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 10 or fragments, analogs or 
5 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 14 or fragments, analogs or 
10 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 16 or fragments, analogs or 
15 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NOs: 10, 55 to 75, 77, 78, 79 or fragments, analogs 
20 or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 or 
25 fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 or 
30 fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 



15 



polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 or 

fragments, analogs or derivatives thereof. 

5 According to one aspect, the present invention relates to 

polypeptides characterized by the amino acid sequence chosen 
from SEQ ID NO: 10, 62, 64, 67, 68, 74, 77 or fragments, 
analogs or derivatives thereof. 

10 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 58 or fragments, analogs or 
derivatives thereof . 

15 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 62 or fragments, analogs or 
derivatives thereof . 

20 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 64 or fragments, analogs or 
derivatives thereof . 

25 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 67 or fragments, analogs or 
derivatives thereof . 

30 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 68 or fragments, analogs or 
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derivatives thereof. 



According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
5 comprising sequence SEQ ID NO: 74 or fragments, analogs or 
derivatives thereof . 



According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
10 comprising sequence SEQ ID NO: 77 or fragments, analogs or 
derivatives thereof . 



In a further embodiment, the present invention also relates to 
chimeric polypeptides which comprise one or more polypeptides 
15 or fragments, analogs or derivatives thereof as described in 
the present application. 

In a further embodiment, the present invention also relates to 
chimeric polypeptides which comprise one or more polypeptides 
20 or fragments, analogs or derivatives thereof as defined in the 
figures of the present application. 



In a further embodiment, the present application also relates 
to chimeric polypeptides which comprise two or more 
25 polypeptides chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof /provided that the polypeptides or fragments, analogs 
or derivatives thereof are linked as to form a chimeric 
polypeptide . 

30 

In a further embodiment, the chimeric polypeptide will comprise 
two or more polypeptides chosen from SEQ ID NOs :10, 58, 60, 
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62, 64, 67, 68, 69, 72, 74, 77 or fragments, analogs or 
derivatives thereof; provided that the polypeptides or 
fragments , analogs or derivatives thereof are linked as to form 
a chimeric polypeptide. 

5 

In a further embodiment, the chimeric polypeptide will comprise 
two or more polypeptides chosen from SEQ ID NOs :10, 58, 60, 
62, 64, 67, 68, 74, 77 or fragments, analogs or derivatives 
thereof; provided that the polypeptides or fragments, analogs 
10 or derivatives thereof are linked as to form a chimeric 
polypeptide . 

In a further embodiment, the chimeric polypeptide will comprise 
two or more polypeptides chosen from SEQ ID NOs :10, 62, 64, 
15 67, 68, 74, 77 or fragments, analogs or derivatives thereof; 
provided that the polypeptides or fragments, analogs or 
derivatives thereof are linked as to form a chimeric 
polypeptide . 

20 In a further embodiment, the chimeric polypeptide will comprise 
between 2 and 5 polypeptides. 

In a further embodiment, the chimeric polypeptide will comprise 
between 2 and 4 polypeptides. 

25 

In a further embodiment, the chimeric polypeptide will comprise 
between 2 and 3 polypeptides. 

In a further embodiment, the chimeric polypeptide will comprise 
30 2 polypeptides. 
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In a further embodiment, there is provided a chimeric 
polypeptide of formula (I) : 
A- (B) m - (C) n -D (I) 



5 Wherein; 

m is 0 or 1, 
n is 0 or 1, 

A is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives thereof; 
10 B is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives thereof; 
C is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives thereof; 
and 

15 D is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 

77 to 79, 81, 83 or fragments, analogs or derivatives thereof. 

In a further embodiment, 

A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 

20 72, 74, 77 or fragments, analogs or derivatives thereof; 

B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 

72, 74, 77, or fragments, analogs or derivatives thereof; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 

72, 74, 77 or fragments, analogs or derivatives thereof; and 

25 D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 

72, 74, 77 or fragments, analogs or derivatives thereof. 

In a further embodiment, 

A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 74, 77 

30 or fragments, analogs or derivatives thereof; 

B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 74, 

77, or fragments, analogs or derivatives thereof; 
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C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 74, 77 

or fragments, analogs or derivatives thereof; and 

D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 74, 77 

or fragments, analogs or derivatives thereof. 

5 

In one embodiment, chimeric polypeptides of the present 
invention comprise those wherein the following embodiments are 
present, either independently or in combination. 

10 In a further embodiment, A is SEQ ID NOs :10, 58, 62, 64, 67, 

68, 74, 77 or fragments, analogs or derivatives thereof. 
In a further embodiment, A is SEQ ID NO :10 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :58 or fragments, 
15 analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :62 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 
20 In a further embodiment, A is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :68 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :74 or fragments, 
25 analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :77 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NOs :10, 58, 62, 64, 67, 

30 68, 74, 77 or fragments, analogs or derivatives thereof. 
In a further embodiment, B is SEQ ID NO :10 or fragments, 
analogs or derivatives thereof. 

20 



In a further embodiment, B is SEQ ID NO : 58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 
5 In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO : 68 or fragments, 
10 analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO : 77 or fragments, 
analogs or derivatives thereof. 

15 

In a further embodiment, C is SEQ ID NOs :10, 58, 62, 64, 67, 

68, 74, 77 or fragments, analogs or derivatives thereof. 
In a further embodiment, C is SEQ ID NO :10 or fragments, 
analogs or derivatives thereof. 
20 In a further embodiment, C is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 62 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO :64 or fragments, 
25 analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 68 or fragments, 
analogs or derivatives thereof. 
30 In a further embodiment, C is SEQ ID NO : 74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 77 or fragments, 
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analogs or derivatives thereof. 



In a further embodiment, D is SEQ ID NO :10, 58, 62, 64, 67, 

68, 74, 77 or fragments, analogs or derivatives thereof. 
5 In a further embodiment, D is SEQ ID NO :10 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :62 or fragments, 
10 analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 
15 In a further embodiment, D is SEQ ID NO :68 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :77 or fragments, 
20 analogs or derivatives thereof. 



In a further embodiment, m is 0. 
In a further embodiment, n is 0. 

25 

In a further embodiment, m and n are 0. 

In a further embodiment, m and n are 0 , A is SEQ ID NO: 64 or 
fragments, analogs or derivatives thereof, B is SEQ ID NO: 62 or 
30 fragments, analogs or derivatives thereof. 

In a further embodiment, m and n are 0, A is SEQ ID NO: 62 or 
fragments, analogs or derivatives thereof, B is SEQ ID NO: 64 or 
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fragments, analogs or derivatives thereof. 

In accordance with the present invention, all nucleotides 
encoding polypeptides and chimeric polypeptides are within the 
5 scope of the present invention. 

In a further embodiment, the polypeptides or chimeric 
polypeptides in accordance with the present invention are 
antigenic . 

10 

In a further embodiment, the polypeptides or chimeric 
polypeptides in accordance with the present invention can 
elicit an immune response in an individual. 

15 In a further embodiment, the present invention also relates to 
polypeptides which are able to raise antibodies having binding 
specificity to the polypeptides or chimeric polypeptides of the 
present invention as defined above. 

20 An antibody that » has binding specificity" is an antibody that 
recognizes and binds the selected polypeptide but which does 
not substantially recognize and bind other molecules in a 
sample, e.g., a biological sample, which naturally includes the 
selected peptide. Specific binding can be measured using an 

25 ELISA assay in which the selected polypeptide is used as an 
antigen. 

Unless otherwise defined, all technical and scientific terms 
used herein have the same meaning as commonly understood by one 
30 of ordinary skill in the art to which this invention belongs. 
All publications, patent applications, patents, and other 
references mentioned herein are incorporated by reference in 
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their entirety. In case of conflict, the present specification, 
including definitions, will control. In addition, the 
materials, methods, and examples are illustrative only and not 
intended to be limiting. 

5 

As used herein, "fragments", "derivatives" or "analogs" of the 
polypeptides of the invention include those polypeptides in 
which one or more of the amino acid residues are substituted 
with a conserved or non- conserved amino acid residue 

10 (preferably conserved) and which may be natural or unnatural. 
In one embodiment, derivatives and analogs of polypeptides of 
the invention will have about 70% identity with those sequences 
illustrated in the figures or fragments thereof. That is, 70% 
of the residues are the same. In a further embodiment, 

15 polypeptides will have greater than 75% homology. In a further 
embodiment, polypeptides will have greater than 80% homology. 
In a further embodiment, polypeptides will have greater than 
85% homology. In a further embodiment, polypeptides will have 
greater than 90% homology. In a further embodiment, 

20 polypeptides will have greater than 95% homology. In a further 
embodiment, polypeptides will have greater than 99% homology. 
In a further embodiment, derivatives and analogs of 
polypeptides of the invention will have fewer than about 2 0 
amino acid residue substitutions, modifications or deletions 

25 and more preferably less than 10. Preferred substitutions are 
those known in the art as conserved i.e. the substituted 
residues share physical or chemical properties such as 
hydrophobicity, size, charge or functional groups. 

30 In accordance with the present invention, polypeptides of the 
invention include both polypeptides and chimeric polypeptides. 
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Also included are polypeptides which have fused thereto other 
compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) to 
increase half -life; leader or secretory amino acid sequences 
5 for ease of purification; prepro- and pro- sequences; and 
(poly) saccharides . 

Furthermore, in those situations where amino acid regions are 
found to be polymorphic, it may be desirable to vary one or 
10 more particular amino acids to more effectively mimic the 
different epitopes of the different streptococcus strains. 

Moreover, the polypeptides of the present invention can be 
modified by terminal -NH 2 acylation (eg. by acetylation, or 
15 thioglycolic acid amidation, terminal carbosy amidation, e.g. 
with ammonia or methylamine) to provide stability, increased 
hydrophobic! ty for linking or binding to a support or other 
molecule . 

20 Also contemplated are hetero and homo polypeptide multimers of 
the polypeptide fragments, analogues and derivatives. These 
polymeric forms include, for example, one or more polypeptides 
that have been cross-linked with cross-linkers such as 
avidin/biotin, gluteraldehyde or dimethyl super imidat e . Such 

25 polymeric forms also include polypeptides containing two or 
more tandem or inverted contiguous sequences, produced from 
multicistronic mRNAs generated by recombinant DNA technology. 
Preferably, a fragment, analog or derivative of a polypeptide 
of the invention will comprise at least one antigenic region 

30 i.e. at least one epitope. 

In order to achieve the formation of antigenic polymers (i.e. 
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synthetic multimers) , polypeptides may be utilized having 
bishaloacetyl groups, nitroarylhalides, or the like, where the 
reagents being specific for thio groups. Therefore, the link 
between two mercapto groups of the different peptides may be a 
5 single bond or may be composed of a linking group of at least 
two, typically at least four, and not more than 16, but usually 
not more than about 14 carbon atoms. 

In a particular embodiment, polypeptide fragments, analogs and 
10 derivatives of the invention do not contain a methionine (Met) 
starting residue. Preferably, polypeptides will not 
incorporate a leader or secretory sequence (signal sequence) . 
The signal portion of a polypeptide of the invention may be 
determined according to established molecular biological 
15 techniques. In general, the polypeptide of interest may be 
isolated from a streptococcus culture and subsequently 
sequenced to determine the initial residue of the mature 
protein and therefore the sequence of the mature polypeptide. 

20 According to another aspect, there are provided vaccine 

compositions comprising one or more streptococcus polypeptides 
of the invention in admixture with a pharmaceutically 
acceptable carrier diluent or adjuvant. Suitable adjuvants 
include oils i.e. Freund's complete or incomplete adjuvant; 

25 salts i.e. A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , AlNH 4 (S0 4 ) 2 , silica, kaolin, 
carbon polynucleotides i.e. poly IC and poly AU. Preferred 
adjuvants include QuilA and Alhydrogel . Vaccines of the 
invention may be administered parenterally by injection, rapid 
infusion, nasopharyngeal absorption, dermoabsorption, or bucal 

30 or oral. Pharmaceutically acceptable carriers also include 
tetanus toxoid . 
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Vaccine compositions of the invention are used for the 
treatment or prophylaxis of streptococcus infection and/or 
diseases and symptoms mediated by streptococcus infection as 
described in P.R. Murray (Ed, in chief), E.J. Baron, M.A. 
5 Pfaller, F.C. Tenover and R.H. Yolken. Manual of Clinical 

Microbiology, ASM Press, Washington, D.C. sixth edition, 1995, 
1482p which are herein incorporated by reference. In one 
embodiment, vaccine compositions of the present invention are 
used for the treatment or prophylaxis of meningitis, otitis 

10 media, bacteremia or pneumonia. In one embodiment, vaccine 
compositions of the invention are used for the treatment or 
prophylaxis of streptococcus infection and/or diseases and 
symptoms mediated by streptococcus infection, in particular 
S. pneumoniae , group A streptococcus {pyogenes) , group B 

15 streptococcus (GBS or agalactiae) , dysgalactiae , uteris, 
nocardia as well as Staphylococcus aureus. In a further 
embodiment, the streptococcus infection is S .pneumoniae . 

In a particular embodiment, vaccines are administered to those 
20 individuals at risk of streptococcus infection such as infants, 
elderly and immunocompromised individuals. 

As used in the present application, the term » individuals" 
include mammals. In a further embodiment, the mammal is human. 

25 

Vaccine compositions are preferably in unit dosage form of 
about 0.001 to 100 |ug/kg (antigen/body weight) and more 
preferably 0.01 to 10 |ig/kg and most preferably 0 . 1 to 1 (ag/kg 
1 to 3 times with an interval of about 1 to 6 week intervals 
30 between immunizations. 
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According to another aspect, there are provided polynucleotides 
encoding polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs : 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 

5 

In one embodiment, polynucleotides are those illustrated in SEQ 
ID Nos: 1, 3, 5, 7, 9, 11, 12, 13, 15, 76, 80, 82 which may 
include the open reading frames (ORF) , encoding polypeptides of 
the invention. It will be appreciated that the polynucleotide 

10 sequences illustrated in the figures may be altered with 
degenerate codons yet still encode the polypeptides of the 
invention. Accordingly the present invention further provides 
polynucleotides which hybridize to the polynucleotide sequences 
herein above described (or the complement sequences thereof) 

15 having 50% identity between sequences. In one embodiment, at 
least 70% identity between sequences. In one embodiment, at 
least 75% identity between sequences. In one embodiment, at 
least 80% identity between sequences. In one embodiment, at 
least 85% identity between sequences. In one embodiment, at 

20 least 90% identity between sequences. In a further embodiment, 
polynucleotides are hybridizable under stringent conditions 
i.e. having at least 95% identity. In a further embodiment, 
more than 97% identity. 

25 In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NOs : 1, 3, 7, 9, 11, 12, 13, 15, 76, 80, 82 encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those illustrated 
30 in SEQ ID NOs : 1, 3, 9, 11, 12, 13, 15, 76, 80, 82 which may 
include the open reading frames (ORF) , encoding polypeptides of 
the invention. 
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In a further embodiment, polynucleotides are those illustrated 

in SEQ ID NOs : 1, 3, 9, 11, 12, 13, 15, 76 which may include 

the open reading frames (ORF) , encoding polypeptides of the 
5 invention. 

In a further embodiment, polynucleotides are those illustrated 

in SEQ ID NOs : 1, 3, 7, 9, 11, 12, 13, 15, 76 which may 
include the open reading frames (ORF) , encoding polypeptides of 
10 the invention. 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NOs : 1, 7, 9, 11, 15, 76 which may include the open 

reading frames (ORF) , encoding polypeptides of the invention. 

15 

In a further embodiment, polynucleotides are those illustrated 

in SEQ ID NOs : 1, 9, 11, 15, 76 which may include the open 

reading frames (ORF) , encoding polypeptides of the invention. 

20 In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NOs : 1, 7, 9, 11 which may include the open reading 
frames (ORF) , encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those illustrated 
25 in SEQ ID NO : 1, encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO :7, encoding polypeptides of the invention. 

30 In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO :9, encoding polypeptides of the invention. 
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In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those illustrated 
5 in SEQ ID NO :15, encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NOs : 3, 12, 13, 76, encoding polypeptides of the 
invention. 

10 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO :3, encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those illustrated 
15 in SEQ ID NO :12, encoding polypeptides of the invention. 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO :13, encoding polypeptides of the invention. 

20 In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO :76, encoding polypeptides of the invention. 

As will be readily appreciated by one skilled in the art, 
polynucleotides include both DNA and RNA. 

25 

The present invention also includes polynucleotides 
complementary to the polynucleotides described in the present 
application. 

30 In a further aspect, polynucleotides encoding polypeptides of 
the invention, or fragments, analogs or derivatives thereof, 
may be used in a DNA immunization method. That is, they can be 
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incorporated into a vector which is replicable and expressible 
upon injection thereby producing the antigenic polypeptide in 
vivo. For example polynucleotides may be incorporated into a 
plasmid vector under the control of the CMV promoter which is 
5 functional in eukaryotic cells. Preferably the vector is 
injected intramuscularly. 

According to another aspect, there is provided a process for 
producing polypeptides of the invention by recombinant 

10 techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
polypeptide product. Alternatively, the polypeptides can be 
produced according to established synthetic chemical techniques 
i.e. solution phase or solid phase synthesis of oligopeptides 

15 which are ligated to produce the full polypeptide (block 
ligation) . 

General methods for obtention and evaluation of 
polynucleotides and polypeptides are described in the following 

20 references: Sambrook et al, Molecular Cloning: A Laboratory 
Manual, 2nd ed, Cold Spring Harbor, N.Y. , 1989; Current 
Protocols in Molecular Biology, Edited by Ausubel F.M. et al . , 
John Wiley and Sons, Inc. New York; PCR Cloning Protocols, from 
Molecular Cloning to Genetic Engineering, Edited by White B.A., 

25 Humana Press, Totowa, New Jersey, 1997, 490 pages; Protein 

Purification, Principles and Practices, Scopes R.K., Springer- 
Verlag, New York, 3rd Edition, 1993, 380 pages; Current 
Protocols in Immunology, Edited by Coligan J.E. et al . , John 
Wiley & Sons Inc., New York which are herein incorporated by 

30 reference. 

For recombinant production, host cells are transfected with 
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vectors which encode the polypeptide, and then cultured in a 
nutrient media modified as appropriate for activating 
promoters, selecting transf ormants or amplifying the genes. 
Suitable vectors are those that are viable and replicable in 
5 the chosen host and include chromosomal, non- chromosomal and 
synthetic DNA sequences e.g. bacterial plasmids, phage DNA, 
baculovirus, yeast plasmids, vectors derived from combinations 
of plasmids and phage DNA. The polypeptide sequence may be 
incorporated in the vector at the appropriate site using 

10 restriction enzymes such that it is operably linked to an 
expression control region comprising a promoter, ribosome 
binding site (consensus region or Shine-Dalgarno sequence) , and 
optionally an operator (control element) . One can select 
individual components of the expression control region that are 

15 appropriate for a given host and vector according to 

established molecular biology principles (Sambrook et al, 
Molecular Cloning: A Laboratory Manual, 2nd ed, Cold Spring 
Harbor, N.Y., 1989; Current Protocols in Molecular Biology, 
Edited by Ausubel F.M. et al . , John Wiley and Sons, Inc. New 

20 York incorporated herein by reference) . Suitable promoters 
include but are not limited to LTR or SV4 0 promoter, E.coli 
lac, tac or trp promoters and the phage lambda P L promoter. 
Vectors will preferably incorporate an origin of replication as 
well as selection markers i.e. ampicilin resistance gene. 

25 Suitable bacterial vectors include pET, pQE70, pQE60, pQE-9, 
pbs, pDIO phagescript, psiX174, pbluescript SK, pbsks, pNH8A, 
pNH16a, pNH18A, pNH46A, ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 and eukaryotic vectors pBlueBacIII, pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG, pSVK3, pBPV, pMSG and pSVL. Host cells may 

30 be bacterial i.e. E.coli , Bacillus subtilis , Streptomvces ; 

fungal i.e. Aspergillus niger , Aspergillus nidulins ; yeast i.e. 
Saccharomvces or eukaryotic i.e. CHO, COS. 
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Upon expression of the polypeptide in culture, cells are 
typically harvested by centrif ugation then disrupted by 
physical or chemical means (if the expressed polypeptide is not 

5 secreted into the media) and the resulting crude extract 

retained to isolate the polypeptide of interest. Purification 
of the polypeptide from culture media or lysate may be achieved 
by established techniques depending on the properties of the 
polypeptide i.e. using ammonium sulfate or ethanol 

10 precipitation , acid extraction, anion or cation exchange 

chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, hydroxy 1 apatite chromatography and 
lectin chromatography. Final purification may be achieved 
using HPLC. 

15 

The polypeptide may be expressed with or without a leader or 
secretion sequence. In the former case the leader may be 
removed using post-translational processing (see US 4,431,739; 
US 4,425,437; and US 4,338,397 incorporated herein by 
20 reference) or be chemically removed subsequent to purifying the 
expressed polypeptide. 

According to a further aspect, the streptococcus polypeptides 
of the invention may be used in a diagnostic test for 
25 streptococcus infection, in particular S. pneumoniae infection. 
Several diagnostic methods are possible, for example detecting 
streptococcus organism in a biological sample, the following 
procedure may be followed: 

a) obtaining a biological sample from a patient ; 
30 b) incubating an antibody or fragment thereof reactive with a 
streptococcus polypeptide of the invention with the 
biological sample to form a mixture; and 
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c) detecting specifically bound antibody or bound fragment in 
the mixture which indicates the presence of streptococcus. 

Alternatively, a method for the detection of antibody specific 
5 to a streptococcus antigen in a biological sample containing or 
suspected of containing said antibody may be performed as 
follows : 

a) obtaining a biological sample from a patient; 

b) incubating one or more streptococcus polypeptides of the 
10 invention or fragments thereof with the biological sample 

to form a mixture; and 

c) detecting specifically bound antigen or bound fragment in 
the mixture which indicates the presence of antibody 
specific to streptococcus. 

15 

One of skill in the art will recognize that this diagnostic 
test may take several forms, including an immunological test 
such as an enzyme -linked immunosorbent assay (ELISA) , a 
radioimmunoassay or a latex agglutination assay, essentially to 
20 determine whether antibodies specific for the protein are 
present in an organism. 

The DNA sequences encoding polypeptides of the invention may 
also be used to design DNA probes for use in detecting the 
25 presence of streptococcus in a biological sample suspected of 
containing such bacteria. The detection method of this 
invention comprises: 

a) obtaining the biological sample from a patient; 

b) incubating one or more DNA probes having a DNA sequence 
30 encoding a polypeptide of the invention or fragments 

thereof with the biological sample to form a mixture; and 

c) detecting specifically bound DNA probe in the mixture 
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which indicates the presence of streptococcus bacteria. 



The DNA probes of this invention may also be used for detecting 
circulating streptococcus i.e. S .pneumoniaenucleic acids in a 
sample, for example using a polymerase chain reaction, as a 
method of diagnosing streptococcus infections. The probe may 
be synthesized using conventional techniques and may be 
immobilized on a solid phase, or may be labelled with a 
detectable label. A preferred DNA probe for this application 
is an oligomer having a sequence complementary to at least 
about 6 contiguous nucleotides of the streptococcus pneumoniae 
polypeptides of the invention. 

Another diagnostic method for the detection of streptococcus in 
a patient comprises : 

a) labelling an antibody reactive with a polypeptide of the 
invention or fragment thereof with a detectable label; 

b) administering the labelled antibody or labelled fragment 
to the patient; and 

c) detecting specifically bound labelled antibody or labelled 
fragment in the patient which indicates the presence of 
streptococcus . 

A further aspect of the invention is the use of the 
streptococcus polypeptides of the invention as immunogens for 
the production of specific antibodies for the diagnosis and in 
particular the treatment of streptococcus infection. Suitable 
antibodies may be determined using appropriate screening 
methods, for example by measuring the ability of a particular 
antibody to passively protect against streptococcus infection 
in a test model. One example of an animal model is the mouse 
model described in the examples herein. The antibody may be a 
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whole antibody or an antigen-binding fragment thereof and may 
belong to any immunoglobulin class. The antibody or fragment 
may be of animal origin, specifically of mammalian origin and 
more specifically of murine, rat or human origin. It may be a 

5 natural antibody or a fragment thereof, or if desired, a 
recombinant antibody or antibody fragment. The term 
recombinant antibody or antibody fragment means antibody or 
antibody fragment which was produced using molecular biology 
techniques. The antibody or antibody fragments may be 

10 polyclonal, or preferably monoclonal. It may be specific for a 
number of epitopes associated with the streptococcus pneumoniae 
polypeptides but is preferably specific for one. 
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Without limiting its scope, the present invention also relates 
to new antigens designated BVH-3, BVH-11, BVH-11-2, BVH-2 8 and 
BVH-71. The present invention also relates to truncated 
polypeptides comprising fragments of the new antigens 
designated BVH-3, BVH-11, BVH-11-2, BVH-28 and BVH-71. The 
present invention also relates to chimeric polypeptides 
comprising fragments of the new antigens designated BVH-3, BVH- 
11, BVH-11-2, BVH-28 and BVH-71. The following is a reference 
table summarizing the relation between the antigens of the 
present invention: 
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Nucleotide SEQ ID 


Polypeptide SEQ ID 
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82 
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EXAMPLE 1 

This example illustrates the cloning of S_l. pneumoniae genes. 

5 

The coding region of pneumoniae gene BVH-3 (SEQ ID NO: 1) and 
the coding region of £L_ pneumoniae gene BVH-2 8 (SEQ ID NO: 5) 
were amplified by PCR (DNA Thermal Cycler GeneAmp PCR system 
2400 Perkin Elmer, San Jose, CA) from genomic DNA of serogroup 6 

10 pneumoniae strain SP64 using the oligos that contained base 

extensions for the addition of restriction sites Bglll (AGATCT) 
and Xbal (TCTAGA) . PCR products were purified from agarose gel 
using a QIAquick gel extraction kit from QIAgen (Chatsworth, 
CA) , digested Bglll-Xbal (Pharmacia Canada Inc, Baie d'Urfe, 

15 Canada) , extracted with phenol : chloroform and precipitated 
with ethanol. The Superlinker vector pSL301 (Invitrogen, San 
Diego, CA) was digested with Bglll and Xbal and purified from 
agarose gel using a QIAquick gel extraction kit from QIAgen 
(Chatsworth, CA) . The Bglll -Xbal genomic DNA fragments were 

20 ligated to the Bglll -Xbal pSL3 01 vector. The ligated products 
were transformed into E^ coli strain DH5a [f80 lacZ DM15 endAl 
recAl hsdR17 ( r KT m K + ) supE44 thi-ll" gyrA96 relAl D(2acZYA- 
argF)U169] (Gibco BRL, Gaithersburg, MD) according to the method 
of Simanis (Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 

25 109-135) . Recombinant pSL3 01 plasmids (rpSL3 01) containing 
either BVH-3 or BVH-28 gene were purified using a QIAgen kit 
(Chatsworth, CA) and DNA inserts were confirmed by nucleotide 
sequence analysis (Taq Dye Deoxy Terminator Cycle Sequencing 
kit, ABI , Foster City, CA) . Recombinant rpSL301 (rpSL301) were 

30 digested with the restriction enzymes Bglll (AGATCT) and Xhol 
(CTCGAG) . DNA fragments Bglll -Xhol were purified using the 
QIAquick gel extraction kit from QIAgen (Chatsworth, CA) . pET- 
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32c ( + ) expression vector (Novagen, Madison, WI) containing the 
thioredoxin-His-Tag sequence was digested with BamHI (GGATCC) 
and Xhol and gel extracted using the QIAquick gel extraction kit 
from QIAgen (Chatsworth, CA) . The Bglll-Xhol DNA fragments were 
5 ligated to the BamHI-XhoI pET-32c(+) vector to create the coding 
sequence for thioredoxin-His • Tag-BVH-3 or thioredoxin-His ■ Tag- 
BVH-28 fusion protein. The ligated products were transformed 
into E. coli strain DH5a [f80 lacZ DM15 endAl recAl hsdRll ( r K~ 
m K + ) supE44 thi-11" gyrA9G relAl D (lacZYA-argF) U169] (Gibco BRL, 
10 Gaithersburg, MD) according to the method of Simanis (Hanahan, 

D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 109-135). 
Recombinant pET-32c(+) plasmids were purified using a QIAgen kit 
(Chatsworth, CA) and the nucleotide sequences at the fusion 
sites of thioredoxin-His-Tag and DNA insert were verified by DNA 

15 sequencing (Taq Dye Deoxy Terminator Cycle Sequencing kit, ABI, 
Foster City, CA) . 

EXAMPLE 2 

20 

This example illustrates the cloning of £L_ pneumoniae protein 
genes in CMV plasmid pCMV-GH. 

The DNA coding region of a £L_ pneumoniae protein was inserted in 
25 phase downstream of a human growth hormone (hGH) gene which was 
under the transcriptional control of the cyt omega lavirus (CMV) 
promotor in the plasmid vector pCMV-GH (Tang et al . , Nature, 
1992, 356 :152) . The CMV promotor is non functional plasmid in 

E. coli cells but active upon administration of the plasmid in 
30 eukaryotic cells. The vector also incorporated the ampicillin 

resistance gene. 
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The coding region of BVH-3 gene (SEQ ID NO: 1) and BVH-28 gene 
(SEQ ID NO: 5) were obtained from rpSL301 (see example 1) using 
restriction enzymes Bglll (AGATCT) and Xbal (TCTAGA) . The 
digested products were purified from agarose gel using the 
QIAquick gel extraction kit from QIAgen (Chatsworth, CA) . The 
pCMV-GH vector (Laboratory of Dr. Stephen A. Johnston, 
Department of Biochemistry, The University of Texas, Dallas, 
Texas) containing the human growth hormone to create fusion 
proteins was digested with Bglll and Xbal and purified from 
agarose gel using the QIAquick gel extraction kit from QIAgen 
(Chatsworth, CA) . The Bglll -Xbal DNA fragments were ligated to 
the Bglll -Xbal pCMV-GH vector to create the hGH-BVH-3 or hGH- 
BVH-28 fusion protein under the control of the CMV promoter. The 
ligated products were transformed into L. coli strain DH5a[f80 
lacZ DM15 endAl recAl hsdR17 ( r K" m K + ) supE44 thi-ir gyrA96 relAl 
D(lacZYA-argF)U169] (Gibco BRL , Gaithersburg, MD) according to 
the method of Simanis (Hanahan, D. DNA Cloning, 1985, D.M. 
Glover (ed) , pp. 109-135). The recombinant pCMV plasmids were 
purified using a QIAgen kit (QIAgen, Chatsworth, CA) . 

The coding region of BVH-11 gene (SEQ ID NO: 3) was amplified by 
PCR (DNA Thermal Cycler GeneAmp PCR system 2400 Perkin Elmer, 
San Jose, CA) from genomic DNA of serogroup 6 S^ pneumoniae 
strain SP64 using the oligos that contained base extensions for 
the addition of restriction sites Bglll (AGATCT) and Hindlll 
(AAGCTT) . The PCR product was purified from agarose gel using a 
QIAquick gel extraction kit from QIAgen (Chatsworth, CA) , 
digested with restriction enzymes (Pharmacia Canada Inc, Baie 
d'Urfe, Canada), extracted with phenol : chloroform and 
precipitated with ethanol . The pCMV-GH vector (Laboratory of 
Dr. Stephen A. Johnston, Department of Biochemistry, The 
University of Texas, Dallas, Texas) was digested with Bglll and 
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Hindi II and purified from agarose gel using the QIAquick gel 
extraction kit from QIAgen (Chatsworth, CA) . The Bglll-Hindlll 
DNA fragment was ligated to the Bglll-Hindlll pCMV-GH vector to 
create the hGH-BVH-11 fusion protein under the control of the 
5 CMV promoter. The ligated products were transformed into coli 
strain DH5a[f80 lacZ DM15 endAl recAl hsdRll ( r K" m K + ) supE44 thi- 
ll" gyrASS relAl D (lacZYA-argF) U169] (Gibco BRL, Gaithersburg, 
MD) according to the method of Simanis (Hanahan, D. DNA Cloning, 
1985, D.M. Glover (ed) , pp. 109-135). The recombinant pCMV 
10 plasmid was purified using a QIAgen kit (Chatsworth, CA) and the 
nucleotide sequence of the DNA insert was verified by DNA 
sequencing. 

15 EXAMPLE 3 

This example illustrates the use of DNA to elicit an immune 
response to pneumoniae antigens. 

20 A group of 8 female BALB/c mice (Charles River, St -Constant, 

Quebec, Canada) were immunized by intramuscular injection of 50 
(il three times at two- or three-week intervals with 100 |ug of 
recombinant pCMV-GH encoding the BVH-3, BVH-11 or the BVH-28 
gene in presence of 50 ^ig of granulocyte -macrophage colony - 

25 stimulating factor (GM-CSF) - expressing plasmid pCMV-GH-GM-CSF 
(Laboratory of Dr. Stephen A. Johnston, Department of 
Biochemistry, The University of Texas, Dallas, Texas) . As 
control, a group of mice were injected with 100 |ug of pCMV-GH in 
presence of 50 (ig of pCMV-GH-GM-CSF. Blood samples were 

30 collected from the orbital prior to each immunization and seven 
days following the third injection and serum antibody responses 
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were determined by ELISA using thioredoxin-His - Tacr-S . pneumoniae 
fusion protein as coating antigen. DNA immunization with 
recombinant plasmid pCMV-GH encoding the BVH-3, BVH-11 or the 
BVH-28 £L pneumoniae protein induced antibody reactive against 
5 the respective recombinant protein. The reciprocal antibody 
titers, defined as the highest serum dilution at which the 
absorbance values were 0.1 above the background values, were 
above 4x1 0 3 . 

10 

EXAMPLE 4 

This example illustrates the production and purification of 
recombinant Sj_ pneumoniae proteins. 

15 

The recombinant pET plasmids containing the BVH-3, BVH-11 or the 
BVH-2 8 gene corresponding to the SEQ ID NO: 1 , SEQ ID NO: 3 or 
the SEQ ID NO: 5 respectively were transformed by 
electroporation (Gene Pulser II apparatus, BIO-RAD Labs, 

20 Mississauga, Canada) into E^ coli strain AD494 (DE3) (Dara" 
leulG91 DlacX74 DphoA PvuII phoR DmalF3 F ' [lac + (lacl q ) pro] 
trxB::Kan) (Novagen, Madison, WI) . In this strain of JL_ coli , 
the T7 promotor controlling expression of the fusion protein is 
specifically recognized by the T7 RNA polymerase (present on the 

25 1DE3 prophage) whose gene is under the control of the lac 
promotor which is inducible by isopropyl-S-d-thio- 
galactopyranoside (IPTG) . The transformant AD494 (DE3) /rpET was 
grown at 37°C with agitation at 250 rpm in LB broth (peptone 
10g/L, yeast extract 5g/L, NaCl lOg/L) containing lOOjag of 

30 ampicillin (Sigma-Aldrich Canada Ltd., Oakville, Canada) per ml 
until the A 60 o reached a value of 0 . 6 . In order to induce the 
production of the thioredoxin-His • Tag-BVH- 3 , thioredoxin- 
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His -Tag-BVH-11 or thioredoxin-His • Tag-BVH-2 8 fusion protein, the 
cells were incubated for 2 additional hours in the presence of 
IPTG at a final concentration of 1 mM. Induced cells from a 
100 ml culture were pelleted by centrifugation and frozen at - 
5 70°C. 

The purification of the fusion proteins from the soluble 
cytoplasmic fraction of IPTG-induced AD494 (DE3 ) /rpET was done by 
affinity chromatography based on the properties of the His -Tag 

10 sequence (6 consecutive histidine residues) to bind to divalent 
cations (Ni 2+ ) immobilized on the His -Bind metal chelation resin. 
Briefly, the pelleted cells obtained from a lOOmL culture 
induced with IPTG were resuspended in phosphate -buf f ered 
(PBS) :500mM NaCl pH7.1, sonicated and spun at 20,000 X g for 20 

15 min to remove debris. The supernatant was filtered (0.22jum pore 
size membrane) and deposited on a HiTrap® ImL chelating pre- 
packed ready-to-use column (Pharmacia Biotech, Baie d'Urfe, 
Canada) . The thioredoxin-His -Tag-S . pneumoniae fusion protein 
was eluted with 1M imidazole- 500mM NaCl-PBS pH7 . 1 . The removal 

20 of the salt and imidazole from the sample was done by dialysis 
against PBS at 4°C. The quantities of fusion protein obtained 
from the soluble fraction of coli was estimated by MicroBCA 
(Pierce, Rockford, Illinois) . 

25 

EXAMPLE 5 

This example illustrates the protection of mice against fatal 
pneumococcal infection by immunization. 

30 

Groups of 8 female BALB/c mice (Charles River) were immunized 
subcutaneously three times at three-week intervals with either 
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25 |ig of affinity purified thioredoxin-His • Tag-BVH-3 fusion 
protein in presence of 15 jag of QuilA adjuvant (Cedarlane 
Laboratories Ltd, Hornby, Canada) or, as control, with QuilA 
adjuvant alone in PBS. Blood samples were collected from the 
5 orbital sinus on day 1, 22 and 43 prior to each immunization and 
seven days (day 50) following the third injection. One week 
later the mice were challenged with approximately 10 6 CFU of the 
type 3 S_^. pneumoniae strain WU2 . Samples of the S_^ pneumoniae 
challenge inoculum were plated on chocolate agar plates to 
10 determine the CFU and to verify the challenge dose. Deaths were 
recorded for a period of 14 days and on day 14 post-challenge, 
the surviving mice were sacrificied and blood samples tested for 
the presence of £L_ pneumoniae organisms . The survival data are 
shown in table 1. 

15 

Prechallenge sera were analyzed for the presence of antibodies 
reactive with S_l. pneumoniae by standard immunoassays. Elisa and 
immunoblot analyses indicated that immunization with recombinant 
S . pneumoniae protein produced in coli elicited antibodies 
20 reactive with both, recombinant and native pneumococcal protein. 

Table 1. Protection mediated by recombinant BVH-3 protein 



Immunogen 


No. of mice alive : no. of mice 
dead 

14 days post -challenge 


Median day of 
death 


BVH-3 


8 : 0 


>14 


none 


0 : 8 


1 



25 All mice immunized with BVH-3 recombinant protein survived to 
infection while none of the control mice given adjuvant alone 
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survived. There was a significant difference in survival 
between the two groups of mice (P< 0.0001, log rank test for 
nonparametric analysis of survival curves; P=0.0002, Fisher's 
exact test) . All hemocultures from surviving mice were negative 
5 at day 14 post-challenge. 



EXAMPLE 6 

10 

This example describes the cloning of BVH-3 and BVH-11 genes 
from a variety of pneumoniae strains and the molecular 
conservation of these genes. 

15 Molecular analysis of chromosomal DNA from various S . 

pneumoniae isolates with DNA probes spanning different regions 
of BVH-3 or BVH-11 revealed the presence of one BVH-3 gene copy 
and two BVH-11 gene copies. The two BVH-11 gene copies are not 
identical and the genes were arbitrarily designated BVH-11 (SEQ 

20 ID NO: 12; ORF at nucleotides 45 to 2567) and BVH-11-2 (SEQ ID 
NO:13; ORF at nucleotides 114 to 2630). 

The first amino acids of the BVH-3 and BVH-11 coding regions 
have the characteristics of leader sequences also known as 

25 signal peptides. The consensus signal peptidase cleavage site 
L-X-X-C of lipoprotein modification/processing sites was 
present in the sequences. Mature BVH-3, BVH-11 and BVH-11-2 
proteins from S_;_ pneumoniae SP64 have 1019, 821 and 819 amino 
acids, respectively. The regions of S_l. pneumoniae genes 

30 coding for mature BVH-3, termed BVH-3M, (nucleotides 1837 - 

4896; SEQ. ID. NO: 11), BVH-11M (nucleotides 102-2567; SEQ. ID. 
NO: 12) and BVH-11 -2M (nucleotides 171-2630; SEQ. ID. NO: 13), 
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were amplified by PCR (DNA Thermal Cycler GeneAmp PCR system 
24 0 0 Perkin Elmer, San Jose, CA) from genomic DNA of 6 or 7 S . 
pneumoniae strains. Serogroup 6 pneumoniae SP64 and 
serogroup 9 SP63 clinical isolates were provided by the 
5 laboratoire de la sante publique du Quebec, Sainte-Anne-de- 
Bellevue; serotype 4 strain JNR.7/87 was provided by Andrew 
Camilli, Tufts University School of Medicine, Boston; Rxl 
strain, a nonencapsulated derivative of the type 2 strain D3 9 
and the type 3 strains A66 and WU2 were provided by David E. 

10 Briles from University of Alabama, Birmingham and the type 3 

clinical isolate P4241 was provided by the centre de recherche 
en inf ectiologie du centre hospitalier de l'universite Laval, 
Sainte-Foy. The sets of oligonucleotide primers OCRR479- 
OCRR480; HAM Jl 6 0- OCRR4 8 8 and HAMJ160 -HAMJ186 , that contained 

15 base extensions for the addition of restriction sites were used 
for the amplification of BVH-3 , BVH-11 and BVH-11-2 gene, 
respectively, with the exception of BVH-11 gene from SP64 
strain which was amplified using the set of primers consisting 
of HAMJ487 and OCRR488. Primer sequences are listed below 

20 (Table 2) . PCR products were purified from agarose gel using 
a QIAquick gel extraction kit from QIAgen (Chatsworth, CA) and 
digested Bglll-Xbal or Bglll-Hindlll (Pharmacia Canada Inc, 
Baie d'Urfe, Canada) . Digestions were cleaned using a QIAquick 
PCR purification kit from QIAgen (Chatsworth, CA) . The PCR 

25 products were ligated to the Bglll-Xbal or Bgl I I -Hindi I I 

pSL301 vector. The ligated products were transformed into E . 

coli strain DH5oc [<|>8 0 lacZ AM15 endAl recAl hsdR17 ( r K~ m K + ) 

supE44 thi-lX' gyrA96 relAl A (lacZYA-argF) U169] (Gibco BRL, 
Gaithersburg, MD) according to the method of Simanis (Hanahan, 
30 D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 109-135). 

Recombinant pSL301 plasmids (rpSL301) containing BVH-3 , BVH-11 
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or BVH11-2 were purified using a QIAgen kit (Chatsworth, CA) 
and DNA inserts were sequenced (Taq Dye Deoxy Terminator Cycle 
Sequencing kit, ABI, Foster City, CA) . The figures 11 and 12 
depict the consensus sequence established from the BVH-3 , and 
5 BVH-11 deduced amino acid sequences, respectively. Comparison 
of BVH-3 protein sequences revealed 99 to 100% identity of 
sequences for all strains with the exception that BVH-3 from 
serogroup 9 SP63 strain (SEQ. ID. NO: 15 and SEQ. ID. NO: 16) 
misses a stretch of 177 amino acids corresponding to residues 
10 244 to 420 on BVH-3 protein sequence of iL_ pneumoniae SP64. 
Analysis of sequences of additional serogroup 9 strains 
revea led BVH-3 molecule having the same deletion in 3 out of 4 
strains thus suggesting that the 3 strains are members of a 
pneumoniae serogroup 9 clone. 

15 

Comparison of 13 BVH-11 nucleotide sequences obtained from 7 
pneumoniae strains, revealed that the nucleotide sequences are 
very similar. Computer analysis (MacVector, Clustal W 1.4) 
using multiple alignment of the predicted BVH-11 protein 

20 sequences revealed that these sequences were 75% identical and 
82 % homologous on a length of 834 amino acids. Pairwise 
alignment revealed 80 to 100% identity (Figure 13) . The 
se q UenC es showed great similarity in overall organization. 
Variability in the primary sequence of these proteins is almost 

25 restricted to the last 125 amino acids in the C-terminal 
portion of the proteins. This region constitutes a domain. 
Close examination of this domain revealed two groups of 
sequences. The first 9 sequences from the figure 13 belong to 
one group while the last 4 sequences belong to another group. A 

30 3 9% identity value is obtained when the domain sequences of the 
13 proteins are compared (MacVector, Clustal W 1.4). The 
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identity value increased to more than 92% when sequences 
belonging to a same group are compared. 

5 EXAMPLE 7 

This example illustrates the homology of portions of BVH-3 and 
BVH-11 genes. 

10 Molecular analysis with DNA probes derived from BVH-3 and BVH- 
11 genes indicated that BVH-3 and BVH-11 were related. In dot 
blot hybridization studies, DNA probe consisting of either, 
BVH-3 or BVH-11, gene sequence hybridized to both, BVH-3 and 
BVH-11 genes thus indicating that BVH-3 and BVH-11 genes shared 

15 homologous sequences. Comparison of sequences revealed that 
the ORFs and the proteins were 43 and 33% identical, 
respectively. Closer examination revealed that the region 
corresponding to amino acids 1 to 225 in BVH-3 and 1 to 228 in 
BVH-11 were 73 and 75% identical at the DNA and protein level, 

20 respectively. In contrast, the 3' regions corresponding to 

amino acids 226 to 1039 from BVH-3 and amino acids 229-840 from 
BVH-11 were only 34 and 22% identical at the DNA and protein 
level, respectively. Thus the 5' termini of BVH-3 and BVH-11 
genes appear to contain highly conserved sequences while the 

25 remaining parts of the genes are highly divergent. These 
results suggest that BVH-3 and BVH-11 might share similar 
functions mediated by sequences present in the conserved region 
whereas BVH-3- and BVH-ll-specif ic functions might be mediated 
by sequences in the divergent region. 

30 

EXAMPLE 8 
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This example describes the cloning of truncated BVH-3 , BVH-11 
and BVH-11 -2 genes by polymerase chain reaction (PCR) and the 
expression of truncated BVH-3 and BVH-11 molecules. 

5 

Gene fragments were amplified by PCR using pairs of 
oligonucleotide engineered to amplify fragments spanning the 
BVH-3 (SEQ ID NO: 1 and SEQ ID NO: 11), BVH-11 (SEQ ID NO: 3 
and SEQ ID NO: 12) or BVH-11-2 (SEQ ID NO: 13) gene from 

10 pneumoniae strain SP64. Each of the primers had a restriction 
endonuclease site at the 5' end, thereby allowing directional 
in-frame cloning of the amplified product into the digested 
plasmid vector (Tables 2 and 3) . PCR-amplif ied products were 
digested with restriction endonucleases and ligated to either 

15 linearized plasmid pSL301 (see example 1) , pCMV-GH (see example 
2) or pET (Novagen, Madison, WI) expression vector digested 
likewise or digested with enzymes that produce compatible 
cohesive ends. Recombinant pSL301 and recombinant pCMV-GH 
plasmids were digested with restriction enzymes for the in- 

20 frame cloning in pET expression vector. Clones were first 

stabilized in E^_ coli DH5a before introduction into E^ coli 
BL21(A,DE3) or AD494 (AJDE3 ) for expression of truncated BVH-3 or 
BVH-11 molecules. Each of the resultant plasmid constructs was 
confirmed by nucleotide sequence analysis. The recombinant 

25 proteins were expressed as N-terminal fusions with the 

thioredoxin and His-tag or as C-terminal fusions with an His- 
tag. The expressed recombinant proteins were purified from 
supernatant fractions obtained from centrif ugation of sonicated 
IPTG-induced E^ coli cultures using a His-Bind metal chelation 

30 resin (QIAgen, Chatsworth, CA) . The gene products generated 
are listed in the table 3. The gene products corresponding to 
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the N- terminal region including the signal sequence are 
designated as Lipidated-proteins or lipoproteins (L-proteins) . 
The gene products corresponding to the N- terminal region 
lacking the signal sequence are identified as protein without 
5 signal sequence (w/o ss) . 



Table 2. List of PCR oligonucleotide primers 

10 



Primer 


SEQ. 
ID. 


Sequence 5' - 3' 


Nucleotide 
position 


Restric- 
tion 
sites 


OCRR479 


17 


cagtagatctgtgcctatgcactaaac 


SEQ ID 1 :61- 
78 


Bglll 


OCRR 480 


18 


gatctctagactactgctattccttacgctatg 


SEQ 

ID 11 :4909- 
4887 


Xbal 


OCRR497 


19 


atcactcgagcattacctggataatcctgt 


SEQ 

ID 1 :1525- 
1506 


Xhol 


UCkK 4yo 




ctgctaagcttatgaaagatttagat 


ID 1 :1534- 
1548 


TTinrlTTT 


OCRR 499 


21 


gatactcgagctgctattccttac 


SEQ 

ID 11 :4906- 
4893 


Xhol 


HAMJ 172 


22 


gaatctcgagttaagctgctgctaattc 


SEQ ID 1 : 

675-661 


Xhol 


HAMJ247 


23 


gacgctcgagcgctatgaaatcagataaattc 


SEQ ID 

1 :3 117-3096 


Xhol 


HAMJ 248 


24 


gacgctcgagggcattacctggataatcctgttcatg 


SEQ ED 

1 :1527-1501 


Xhol 


HAMJ 249 


25 


cagtagatctcttcatcatttattgaaaagagg 


SEQ ID 11 : 
1749-1771 


Bglll 


HAMJ 278 


26 


ttatttcttccatatggacttgacagaagagcaaattaag 


SEQ ID 

1 :1414-1437 


Ndel 


HAMJ 279 


27 


cgccaagcttcgctatgaaatcagataaattc 


SEQ ID 

1 :3 117-3096 


Hindlll 


HAMJ 280 


28 


cgccaagcttttccacaatataagtcgattgatt 


SEQ ID 

1 :2400-2377 


Hindlll 


HAMJ 281 


29 


ttatttcttccatatggaagtacctatcttggaaaaagaa 


SEQ ED 

1 :239 8-2421 


Ndel 


HAMJ 300 


30 


ttatttcttccatatggtgcctatgcactaaaccagc 


SEQ ID 1 :62- 
82 


Ndel 



50 



HAMJ313 


31 


ataagaatgcggccgcttccacaatataagtcgattgatt 


SEQJD 

1 :2400-2377 


NotI 


OCRR487 


32 


cagtagatctgtgcttatgaactaggtttgc 


SEQ JD 3 :58- 
79 


Bgin 


OCRR488 


33 


gatcaagcttgctgctacctttacttactctc 


SEQ ID 

12 -2577-2556 


Hindlll 


HAMJ 171 


34 


ctgagatatccgttatcgttcaaacc 


SEQ ID 

3 -1060-1075 


EcoRV 


HAMJ251 


35 


ctgcaagcttttaaaggggaataatacg 


SEQ ID 

3 :1059-1045 


Hindlll 


HAMJ 264 


36 


cagtagatctgcagaagccttcctatctg 


SEQ ID 3 :682- 
700 


Bglll 


HAMJ 282 


37 


tcgccaagcttcgttatcgttcaaaccattggg 


SEQ ID 

3 -1060-1081 


Hindlll 


HAMJ 283 


38 


ataagaatgcggccgccttactctcctttaataaagccaat 
agii 


SEQ ID 

3 -2520-2492 


Ndel 


HAMJ 284 


39 


catgccatggacattgatagtctcttgaaacagc 


SEQ ID 3 :856- 

880 


Ncol 


HAMJ 285 


40 


cgccaagcttcttactctcctttaataaagccaatag 


SEQ ID 

3 -2520-2494 


Hindlll 


HAMJ 286 


41 


cgacaagcttaacatggtcgctagcgttacc 


SEQ ID 

3 -2139-2119 


Hindlll 


HAMJ 287 


42 


cataccatgggcctttatgaggcacctaag 


SEQ ID 

3 -2014-2034 


Ncol 


HAMJ 288 


43 


cgacaagcttaagtaaatcttcagcctctctcag 


SEQ ID 

3 -2376-2353 


Hindlll 


HAMJ 289 


44 


gataccatggctagcgaccatgttcaaagaa 


SEQ ID 

3 -2125-2146 


Ncol 


HAMJ 290 


45 


cgccaagcttatcatccactaacttgactttatcac 


SEQ ID 

3 -1533-1508 


Hindlll 


HAMJ 291 


46 


cataccatggatattcttgccttcttagctccg 


SEQ ID 

3 -1531-1554 


Ncol 


HAMJ 301 


47 


catgccatggtgcttatgaactaggtttgc 


SEQ ID 3 :59- 
79 


Ncol 


HAMJ 302 


48 


cgccaagctttagcgttaccaaaaccattatc 


SEQ ID 

3 -2128-2107 


Hindlll 


HAMJ 160 


49 


gtattagatctgttcctatgaacttggtcgtcacca 


SEQ ID 13 : 
172-196 


Bgin 


HAMJ 186 


50 


cgcctctagactactgtataggagccgg 


SEQ ID 13: 
2460-2443 


Xbal 


HAMJ 292 


51 


catgccatggaaaacatttcaagccttttacgtg 


SEQ ID 11: 
754-778 


Ncol 


HAMJ 293 


52 


cgacaagcttctgtataggagccggttgactttc 


SEQ ID 11 : 
2457-2434 


Hindlll 


HAMJ 294 


53 


catgccatggttcgtaaaaataaggcagaccaag 


SEQ ID 11 : 
2038-2062 


Ncol 



51 



HAMJ297 


54 


catgccatggaagcctattggaatgggaag 


SEQID11 : 


Ncol 
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EXAMPLE 9 



This example describes the isolation of monoclonal 
antibodies (Mabs) and the use of Mabs to characterize BVH- 
5 3, BVH-11 and BVH-11-2 protein epitopes. 

Female BALB/c mice (Charles River) were immunized 
subcutaneously with BVH-3 , BVH-11 or BVH-11-2 gene products 
from £Lu pneumoniae strain SP64 in presence of 15 /ig of 

10 QuilA adjuvant (Cedarlane Laboratories Ltd, Hornby, 
Canada) . One set of mice (fusion experiment 1) were 
immunized on day 1 and 14 with 25 jug of affinity purified 
thioredoxin-His*Tag-BVH-3M fusion protein. A second group 
of mice (fusion experiment 2) were immunized three times at 

15 three-week intervals with 25 /xg of affinity purified 

thioredoxin-His»Tag-BVH-llM. A third group of mice (fusion 
experiment 3) were immunized on day 1 and day 15 with 25 /xg 
of affinity purified thioredoxin-His*Tag-BVH-ll-2M fusion 
protein. A fourth group of mice (fusion experiment 4) were 

20 immunized on day 1 with 25 fig of affinity purified 

thioredoxin-His*BVH-llB fusion protein and boosted by 
intravenous injection on day 16 and on day 3 7 with 
recombinant BVH-11B in PBS . Three to four days before 
fusion, mice were injected intravenously with 25 fig of the 

25 respective antigen suspended in PBS alone. Hybridomas were 
produced by fusion of spleen cells with nonsecreting SP2/0 
myeloma cells as previously described by J. Hamel et al . 
[J. Med. Microbiol., 23, ppl63-170 (1987)]. Culture 
supernatants of hybridomas were initially screened by 

30 enzyme-linked-immunoassay according to the procedure 

described by Hamel et al . (Supra) using plates coated with 
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preparations of purified recombinant proteins or 
suspensions of heat-killed pneumoniae cells. Positive 
hybridomas selected on the basis of ELISA reactivity with a 
variety of antigens were then cloned by limiting dilutions, 
5 expanded and frozen. 

Hybridomas were tested by ELISA or Western immunoblotting 
against BVH-3 and BVH-11 gene products in order to 
characterize the epitopes recognized by the Mabs . BVH-3 

10 and BVH-11 shared common epitopes with 6 Mabs (H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll-1 . 1-G11) 
showing reactivities with both proteins (Table 4) . BVH-11 
and BVH-11-2 molecules from S^. pneumoniae SP64 shared 
common epitopes not present on BVH-3 with Mabs (3A1, 13C11, 

15 10H10, 1D8, 10G9, 10A2, 3E8, 10D7, 2H7 and 6H7) reactive 

with both, BVH-11 and BVH-11-2, recombinant proteins (Table 
5) . 



Table 4. Reactivity of BVH-3 -immunoreactive Mabs with a 
20 panel of BVH-3 and BVH-11 gene products 





a . Immunoreactivity with 


MAbs 


BVH-3M 


BVH-3 A 


BVH-3B 


BVH-3C 


NEW2 


NEW3 


BVH-11M 




21-1039 


21-509 


512-1039 


21-225 


472-800 


800-1039 


20-840 


H3-1-F9 


+ 


+ 




+ 






+ 


H3-1-D4 


+ 


+ 




+ 






+ 


H3-1-H12 


+ 


+ 




+ 






+ 


H3-2-G2 


+ 


+ 












H3-3-A1 


+ 


+ 












H3-4-D3 


+ 




+ 






+ 




H11-1-E7 


+ 


+ 




+ 






+ 


Hll-1- 


+ 


+ 










+ 
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H10 
















Hll- 

1. 1-G11 


+ 


+ 




+ 


+ 




+ 



Table 5. Reactivity of Mabs raised against BVH-11-2 
protein from £L_ pneumoniae strain SP64 with a panel of BVH- 
5 11 gene products 





b . Immunoreactivity with 


Mabs a 


c.BVH-11 products 


d. BVH-11-2 products 




i5 vll ± XIYi 




"KTTPTaT q 

JMbwy 




R VH - 1 1 - 2 

l—t V J. J. X i. ^-j 


1\| XL W ± U 


IN Hi Vil _L _L 


NTRW1 4 




20-840 


286-511 


511-713 


354-840 


20-838 


271-838 


699-838 


227-699 


3A1 


+ 


+ 




+ 


+ 


+ 




+ 


13C1 


+ 


+ 


+ 


+ 


+ 


+ 




+ 


10H10 


+ 


+ 


+ 


+ 


+ 


+ 




+ 


1D8 


+ 


+ 




+ 


+ 


+ 




+ 


10G9 


+ 






+ 


+ 


+ 




+ 


10A2 


+ 






+ 


+ 


+ 




+ 


3E8 


+ 






+ 


+ 


+ 




+ 


10D7 


+ 






+ 


+ 


+ 




+ 


2H7 


+ 








+ 








6H7 


+ 








+ 








3A4 










+ 


+ 






14H6 










+ 


+ 


+ 




7G2 










+ 


+ 




+ 


13H10 










+ 






+ 


7E8 










+ 








7H6 










+ 









Mabs listed in this table were not reactive with recombinant BVH-3 molecule 



The results obtained from the immunoreactivity studies of 
the Mabs (Table 4 and Table 5) are in agreement with the 
10 protein sequences derived from the respective gene 

sequences. Indeed the Mabs cross-reactive with BVH-3 and 
BVH-11 molecules recognized BVH-3C protein corresponding to 
the conserved region, and BVH-11 and BVH-11-2 specific Mabs 
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were reactive with epitopes located on variable parts of 
these molecules. BVH-3 and BVH-11, and BVH-11 and BVH-11-2 
can be distinguished by their reactivity with Mabs . 

5 

EXAMPLE 10 

This example illustrates the simultaneous expression of 
10 BVH-3 and BVH-11 gene products by pneumoniae . 

A standard Western blot technique was used to investigate 
whether BVH-3 and BVH-11 genes were expressed in £L_ 
pneumoniae . S . pneumoniae strain SP64 and SP63 were grown 

15 overnight at 3 7°C in 5% C0 2 on chocolate agar plates, 

bacteria were suspended in PBS and heat-killed at 56°C for 
2 0 min. For the preparation of antigens, suspensions of 
pneumoniae were treated with sample buffer containing SDS 
and 2-mercaptoethanol for 5 min at 100 °C. Pneumococcal 

20 protein antigens were resolved by SDS -PAGE electrophoresis 
according to the method of Laemmli [Nature, 227, pp. 680- 
685 (1970)]. After SDS-PAGE, the proteins were transferred 
electrophoretically from the gel to nitrocellulose paper by 
the method of Towbin [Proc. Natl. Acad. Sci . USA, 76, pp. 

25 4350-4354 (1979)] and probed with mouse antiserum or 

monoclonal antibodies. The detection of antigens reactive 
with the antibodies was performed by indirect enzyme- 
immunoassay using conjugated-anti -mouse immunoglobulins and 
a colour substrate. When antiserum raised to recombinant 

30 BVH-3 was tested against S^ pneumoniae SP64 antigens, two 
reactive bands having apparent molecular masses of 12 7 kDa 
and 99 kDa were detected. Bands having the same apparent 



58 



molecular masses were also detected when Mabs H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll - 1 . 1 -Gil were 
used individually as immunological probes. In contrast, 
Mabs specific for the BVH-3 molecule detected the 12 7 kDa 
5 band only and Mabs specific for BVH-11 detected the 99 kDa 
band only thus confirming the identity of the 12 7 and 99 
kDa bands as BVH-3 and BVH-11, respectively. These studies 
provide evidence that BVH-3 and BVH-11 proteins are 
simultaneously present on pneumoniae . Moreover, the 
10 results are consistent with our previous observations that 
BVH-3 and BVH-11 possess epitopes that are common to both 
proteins and epitopes that are exclusive to either protein. 

In iL. pneumoniae SP64, mature BVH-3, BVH-11 and BVH-11-2 
15 are proteins of 1019, 821 and 819 amino acids with 

predicted molecular mass of 112.5 kDa, 92.4 kDa, and 91.7 
kDa, respectively. Although there is a discrepancy between 
the molecular mass predicted from the sequence and the 
molecular mass calculated on SDS-PAGE, BVH-3 can be 
20 distinguished from BVH-11 by its higher molecular mass. 

Moreover, BVH-3 molecules from £L_ pneumoniae strain SP63 
have an apparent molecular mass of 112 kDa in SDS-PAGE 
compared to 12 7 kDa for BVH-3 of SP64 strain. This data is 
consistent with the deletion of a stretch of 177 amino acid 
25 residues in BVH-3 of S_u pneumoniae strain SP63 . 

EXAMPLE 11 

30 This example describes the protection conferred in 

experimental infection of mice vaccinated with recombinant 
BVH-3 or BVH-11 gene products. 
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Groups of 7 or 8 female BALB/c mice (Charles River) were 
immunized subcutaneously three times at three-week 
intervals with either affinity purified thioredoxin- 
5 His»Tag-BVH-3M fusion protein, affinity purified 

thioredoxin-His»Tag-BVH-llM fusion protein or, as control, 
with QuilA adjuvant alone in PBS. Twelve to 14 days 
following the third immunization, the mice were challenged 
intravenously with pneumoniae WU2 strain or intranasally 

10 with P4241 strain. Samples of the £L_ pneumoniae challenge 
inoculum were plated on chocolate agar plates to determine 
the CFU and to verify the challenge dose. The challenge 
dose was approximately 10 6 CFU. Deaths were recorded for a 
period of 14 days and on day 14 post -challenge, the 

15 surviving mice were sacrificed and blood samples tested for 
the presence of S_i. pneumoniae organisms. The survival data 
are shown in Tables 6 and 7 . 



20 



Table 6. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental infection with virulent £L_ 
pneumoniae WU2 



Experiment 


Immunogen 


Alive 


: dead 3 


Median days alive 


1 


BVH-3M 


8 : 0 




>14 




none 


0 : 8 




1 


2 


BVH-11M 


8 : 0 




>14 




none 


0 : 8 




1 



25 a The number of mice alive : the number of mice dead on day 
14 post-challenge. 
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Table 7. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental pneumonia with virulent S . 
pneumoniae P4241 



Experiment 


Immunogen 


Alive 


: dead a 


Median day alive 


1 


BVH-3M 


6 


: 1 




>14 




none 


1 


: 7 




4 . 5 


2 


BVH-3M 


8 


: 0 




>14 




BVH-11M 


8 


: 0 




>14 




none 


0 


: 8 




4 



5 a The number of mice alive : the number of mice dead on day 
14 post-challenge . 

All mice immunized with recombinant BVH-3M or BVH-11M 
protein survived to infection with WU2 while none of the 

10 control mice given adjuvant alone survived. All except one 
mice immunized with recombinant BVH-3M or BVH-11M protein 
survived to infection with P4241 while only one control 
mice given adjuvant alone survived. All hemocultures from 
surviving mice were negative at day 14 post -challenge . 

15 These results clearly indicate that both, BVH-3M and BVH- 
11M, elicit protective ant i -pneumococcal immune responses 
in mice. The fact that these proteins are highly conserved 
among £L_ pneumoniae isolates emphasize the potential of 
BVH-3 and BVH-11 as universal vaccine candidates. Indeed, 

20 the BVH-3 and BVH-11 proteins from serogroup 6 S . 
pneumoniae strain SP64 elicited protection against 
pneumococcal infections with strains of different capsular 
serotypes . 

25 Ideally, a vaccine that could protect against pneumococcal 
disease, could protect against meningitis, otitis media, 
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bacteremia and pneumonia. BVH-3 and BVH-11 were protective 
against lethal systemic- and pneumonia-infection models 
thus suggesting that, in humans, BVH-3- and BVH11 -protein- 
based vaccines could reduce the incidence of a wide 
5 spectrum of disease caused by virtually all pneumoniae 
independently of the capsular serotype. 

Data from Tables 6 and 7 clearly demonstrate that BVH-3 and 
BVH-11 were, both, protection-eliciting molecules of 

10 pneumoniae . It was not known, however, whether protection 
can be mediated by specific sequences that were not shared 
on BVH-3 and BVH-11 molecules. Groups of female BALB/c 
mice (Charles River) were immunized subcutaneously three 
times at three-week intervals with either affinity purified 

15 thioredoxin-His^Tag- BVH-3 AD, -BVH-3B or -BVH-3C fusion 
protein in presence of 15 fig of QuilA adjuvant (Cedarlane 
Laboratories Ltd, Hornby, Canada) . Control mice were 
immunized with QuilA adjuvant alone in PBS or affinity 
purified thioredoxin-His*Tag or thioredoxin-His*Tag- fusion 

20 protein (His-Thio) in presence of QuilA. 

To determine the protective ability of a set of truncated 
proteins, termed NEW4, NEW5, NEW6, NEW7, NEW 8 , NEW 9 , NEW10, 
NEW11, NEW14 and BVH-11B, groups of female BALB/c mice 

25 (Charles River) were immunized subcutaneously two times at 
three-week intervals with 25 fig of either affinity purified 
His^Tag-fusion protein in presence of 15 fig of QuilA 
adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent S_;_ pneumoniae . Our 

30 results indicate that, BVH-3B, a truncated BVH-3 molecule 
consisting of amino acids 512-1039, elicited protection 
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against the mouse-virulent strains WU2 and P4241. 
Similarly, BVH-11B, NEW4 and NEWS molecules, three 
truncated BVH-11 molecules consisting of amino acids 354- 
840, amino acids 286-840 and amino acids 286-713, 
5 respectively, elicited protection against experiment 

intravenous challenge with WU2 and intranasal challenge 
with P4241. Moreover, vaccination with NEW10 and NEW14, 
consisting of amino acids 272-838 and amino acids 227-699 
from BVH-11-2 molecule also resulted in protection against 

10 death with the pneumococcal strains. These results 
indicate that the region comprising 42 8 amino acids 
extending from amino acids 286-713 and amino acids 272-699 
on pneumoniae SP64 BVH-11 and BVH-11-2 protein 
sequences, respectively, contains protective epitopes. 

15 This region is highly conserved with a global 91% identity 
and 94% homology among thirteen BVH-11 protein sequences. 
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Table 8. Evaluation of protection elicited by vaccination 
of mice with BVH-3 and BVH-11 gene products 





Challenge with WU2 


Challenge with P4241 


Experiment 


Immunogen 


Alive 


: dead 8 


Median day 


Alive 


: dead 


Median day 










alive 






alive 


i b 


None 


0 : 


8 


1.5 


1 : 


7 


4.5 




NEW4 


8 : 


0 


>14 


8 : 


0 


>14 




NEW 5 


8 : 


0 


>14 


8 : 


0 


>14 




NEW 7 


0 : 


8 


2 


0 : 


8 


5 




BVH-11M 


8 : 


0 


>14 


8 : 


0 


>14 


2 b 


None 


0 : 


8 


1 


0 : 


8 


4 




NEW 5 


8 : 


0 


>14 


8 : 


0 


>14 




NEW 8 


0 : 


8 


1.5 


0 : 


8 


5.5 




NEW 9 


3 : 


5 


3.5 


2 : 


6 


7 




BVH-11M 


8 : 


0 


>14 


8 : 


0 


>14 


3 h 


None 


0 : 


8 


1 


0 : 


8 


4 




NEW 6 


0 : 


8 


1 


4 : 


4 


10. 5 C 




NEW10 


8 : 


0 


>14 


8 : 


0 


>14 




NEW11 


0 : 


8 


1.5 


1 : 


7 


6 




BVH-11M 


8 : 


0 


>14 


8 : 


0 


>14 


4 b 


None 


0 : 


8 


2 


0 : 


8 


4 




BVH-11B 


7 : 


1 


>14 


8 : 


0 


>14 




NEW 14 


8 : 


0 


>14 


8 : 


0 


>14 


5 


His-Thio 


0 : 


8 


2 










BVH-3 AD 


1 : 


7 


2.5 










BVH-3B 


5 : 


3 


>14 








6 


His-Thio 


0 : 


8 


1 










BVH-3C 


0 : 


8 


1 









a The number of mice alive 

14 post-challenge . 
b The WU2 challenge dose was 10 5 CFU. 



the number of mice dead on day 
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c Mice living longer than 14 days were assigned a survival 
time of 14 days for the determination of median values. 

5 

EXAMPLE 12 

This example described the cloning and expression of a 
chimeric gene encoding for a chimeric polypeptide 
10 corresponding to the carboxy- terminal region of BVH-3 in 

fusion at the C end to the carboxy- terminal region of BVH- 
11 and the additive protection observed after vaccination 
with a chimeric polypeptide. 

15 It is clear from the studies described above that BVH-3 and 
BVH-11 are serologically distinct molecules simultaneously 
present on pneumoniae . The results of immunological 
studies of mice indicate that both proteins are good 
vaccine candidates. These proteins have the potential to 

20 provide protection against all pneumococci, regardless of 
serotype. Even though the two proteins share epitopes and 
sequences, they have different characteristics and may 
serve different biological functions. Thus 7 immunization 
against the two proteins may provide a higher level of 

25 protection than that imparted by each individually. To 
examine this, several avenues where full-length or 
truncated BVH-3 and BVH-11 are administered in combination 
or in conjugation can be explored. Here we describe the 
genetic engineering of a BVH-3 -BVH-11 fusion gene and 

30 protein, termed NEW12 (SEQ ID NO:76 and SEQ ID NO:58, 

respectively) , and the potential use of NEW 12 protein as a 
vaccine . 
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BVH-3 and BVH-11 gene fragments corresponding to the 3 ' end 
of the genes were amplified by PCR using pairs of 
oligonucleotides engineered to amplify fragments spanning 
5 nucleotides 1414 to 3117 (SEQ ID NO: 1) and nucleotides 1060 
to 2520 (SEQ ID NO: 3) from pneumoniae strain SP64 BVH-3 
and BVH-11 genes, respectively. The primers used, HAMJ278 
and HAM J2 7 9 ; HAMJ282 and HAMJ283 had a restriction 
endonuclease site at the 5' end, thereby allowing 

10 directional in- frame cloning of the amplified product into 
the digested pET21b(+) plasmid vector (Table 2) . PCR- 
amplified products were digested with restriction 
endonucleases and ligated to linearized plasmid pET21b(+) 
vector digested likewise. The resultant plasmid constructs 

15 were confirmed by nucleotide sequence analysis. The 

recombinant pET21b(+) plasmid containing the Ndel-Hindlll 
BVH-3 PCR product was linearized by digestion with the 
restriction enzymes Hindi I I and Not I for the in- frame 
cloning of the Hindi I I -Not I DNA fragment obtained from the 

20 recombinant pET21(+) vector containing the BVH-11 gene 
fragment. Clones were first stabilized in EL_ coli DH5a 
before introduction into EL. coli BL21(ADE3) for expression 
of a chimeric pneumococcal protein molecule. The 
recombinant chimeric polypeptide, termed NEW 12, was 

25 expressed as C- terminal fusion with an His -tag. The 

expressed recombinant NEW 12 protein was purified from 
supernatant fractions obtained from centrif ugation of 
sonicated IPTG-induced |L_ coli cultures using a His-Bind 
metal chelation resin (QIAgen, Chatsworth, CA) . 



66 



According to the same procedure described above, it is 
possible to construct other chimeric polypeptides, as a 
result of a simultaneous expression of New 1 and New 4, New 
1 and New 5, New 1 and New 10, or New 1 and New 14. The 
5 construction can be with New 1 upstream or downstream of 
New 4, New 5, New 10, BVH-11B or New 14. It is also 
possible to construct other chimeric polypeptides as a 
result of a simultaneous expression of more than two 
fragments of either genes of BVH-3, BVH-11 or BVH-11-2. 

10 

Groups of 8 female BALB/c mice (Charles River) were 
immunized subcutaneously two times at three -week intervals 
with 25 fig of either affinity purified His«Tag-f usion NEW1, 
BVH-11B or NEW 12 protein in presence of 15 fig of QuilA 

15 adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent pneumoniae . As 
demonstrated before, NEW1 and BVH-11B molecules comprising 
amino acids 472 to 103 9 from BVH-3 protein and amino acids 
354-840 from BVH-11 protein, respectively, correspond to 

20 portions of the proteins capable of eliciting a protective 
immune response. To determine if a chimeric polypeptide 
would significantly improve the protection compared with 
those seen for the individual counterparts, the challenge 
dose was adjusted in a manner that protection was not 

25 expected with NEW1 and BVH-11B molecules. Interestingly, 

the chimeric NEW 12 protein, elicited protection against the 
mouse -virulent strains WU2 and P4241. Seven out of 8 mice 
immunized with NEW 12 were still alive 10 days after the 
challenge while 2 8 out of 32 mice immunized with NEW1, BVH- 

30 11B, BVH-3M or adjuvant alone were dead by five days post- 
challenge. Thus, vaccination of mice with NEW12 provided 
the highest degree of protection against WU2 challenge. 
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These results indicate that immunization with a chimeric 
polypeptide and possibly a combination of BVH-3 and BVH-11 
gene products can provide additional protection to that 
obtained by administration of BVH-3 or BVH-11 antigens 
5 alone. 



Table 9. Evaluation of protection elicited by vaccination 
of mice with the chimeric NEW 12 molecule 





Challenge with WU2 


Challenge with P4241 


Immunogen 


Alive : dead a 


Median day 


Alive : dead- 


Median day 






alive 




alive 


None 


0 : 8 


l 


0 : 8 


5 


NEW1 


2 : 6 


2 


1 : 7 


8 


BVH-11B 


1 : 7 


3 . 5 


8 : 0 


>14 


NEW12 


6 : 2 


>14 


7 : 1 


>14 


BVH-3M 


1 : 7 


3 


8 : 1 


>14 



10 

EXAMPLE 13 

This example illustrates the identification of additional 
15 BVH-3 and BVH-11 related sequences in Streptococcus species 
other than S_;_ pneumoniae . 

It was previously shown that BVH-3 , BVH-11 and BVH-11-2 are 
a family of related proteins sharing common sequences. 
20 Homology searches were performed with the nucleotide 
sequence from the conserved region of these genes and 
compared with GenBank and EMBL sequences using FASTA. The 
most significant homology was observed with a 2.469-kb gene 
coding for a calculated 92-kDa protein (SEQ ID NO: 81) of 
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unknown function in S^ agalactiae also called group B 
streptococcus or GBS. The gene was designated BVH-71. A 
protein demonstrating 99.2% identity and 99.5% similarity 
with that of GBS was also identified in S_;_ pyogenes also 
5 called group A streptococcus or GAS (SEQ ID NO: 83) . The 5' 
region of the BVH-71 sequences (SEQ ID NO: 80 and SEQ ID 
NO: 82) , spanning nucleotides 1 to 717, demonstrated 58 and 
60% identity with the conserved regions of BVH-3 
(nucleotides 1 to 675) and BVH-11 (nucleotides 1 to 684) 

10 genes respectively. The first 239 amino acids of the 

translated sequences of the GBS and GAS BVH-71 open reading 
frames are 51 and 54% identical to the first 225 and 22 8 
amino acids of BVH-3 and BVH-11, respectively. In addition 
to structural similarities, streptococcal BVH-3, BVH-11 and 

15 BVH-71 proteins also share antigenic epitopes. A 97-kDa 
band was revealed on Western blots of GAS or GBS whole 
cells, using Mab Hll-l.l-Gll reactive with the BVH-3 and 
BVH-11 conserved regions. Similarly, GAS and GBS 
recombinant BVH-71 proteins were detected in Western 

20 immunoblot analysis. 

These results indicate that BVH-71, BVH-3 and BVH-11 
proteins might share similar functions. Our results also 
suggest that BVH-71 proteins can be used as protein vaccine 
components of anti -streptococcus . In a further embodiment 

25 BVH-71 proteins can be used as protein vaccine components 
of anti -GAS or anti -GBS vaccines. 
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What is claimed is: 

1. An isolated polynucleotide encoding a polypeptide having 
at least 70% identity to a second polypeptide having a 
sequence chosen from: SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 
55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 95% 
identity to the second polypeptide. 

3. An isolated polynucleotide encoding a polypeptide capable 
of generating antibodies having binding specificity for a 
polypeptide having a sequence chosen from: SEQ ID NOs: 2, 
4, 6, 8, 10, 14, 16, 55 to 75, 77 to 79, 81, 83 or 
fragments, analogs or derivatives thereof . 

4. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 1. 

5. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 3. 

6. The polynucleotide of claim 1, wherein said 
polynucleotide is DNA. 

7. The polynucleotide of claim 3, wherein said 
polynucleotide is DNA. 

8. The polynucleotide of claim 1, wherein said 
polynucleotide is RNA. 
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9. The polynucleotide of claim 3, wherein said 
polynucleotide is RNA. 

10. A vector comprising the polynucleotide of claim 1, 
wherein said DNA is operably linked to an expression 
control region. 

11. A vector comprising the polynucleotide of claim 3, 
wherein said DNA is operably linked to an expression 
control region. 

12. A host cell transfected with the vector of claim 10. 

13. A host cell transfected with the vector of claim 11. 

14. A process for producing a polypeptide comprising 
culturing a host cell according to claim 12 under 
conditions suitable for expression of said polypeptide. 

15. A process for producing a polypeptide comprising 
culturing a host cell according to claim 13 under 
condition suitable for expression of said polypeptide. 

16. An isolated polypeptide having at least 70% identity to a 
second polypeptide having an amino acid sequence chosen 
from: SEQ ID NOs : 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 
to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

17. An isolated polypeptide capable of generating antibodies 
having binding specificity for a second polypeptide 
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having a sequence chosen from: SEQ ID NOs: 2, 4, 6, 8, 
10, 14, 16, 55 to 75, 77 to 79, 81, 83 or fragments, 
analogs or derivatives thereof. 

18. An isolated polypeptide having an amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

19. An isolated polypeptide according to claim 18, wherein the 
N- terminal Met residue is deleted. 

20. An isolated polypeptide according to claim 18, wherein the 
secretory amino acid sequence is deleted. 

21. A chimeric polypeptide comprising two or more 
polypeptides chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 
16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as 
to form a chimeric polypeptide. 

22. A chimeric polypeptide comprising two or more polypeptides 

chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 

72, 74, 77 or fragments, analogs or derivatives thereof; 
provided that the polypeptides or fragments, analogs or 
derivatives thereof are linked as to form a chimeric 
polypeptide . 

23. A chimeric polypeptide of formula (I) : 
A- (B) m - (C) a -D (I) 

Wherein; 
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m is 0 or 1, 
n is 0 or 1, 

A is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 

to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; 

B is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 

to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; 

C is chosen from SEQ ID NOs : 2, 4, 6, 8, 10, 14, 16, 55 

to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; and 

D is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof . 

A chimeric polypeptide of formula (I) : 

A- (B) n - (C)a-D (I) 

Wherein; 

m is 0 or 1, 

n is 0 or 1, 

A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; 

B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77, or fragments, analogs or derivatives 
thereof; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; and 

D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof . 
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A vaccine composition comprising a polypeptide according 
to any one of claims 16 to 24 and a pharmaceutical ly 
acceptable carrier, diluent or adjuvant. 

A method for therapeutic or prophylactic treatment of 
meningitis, otitis media, bacteremia or pneumonia 
infection in an individual susceptible to meningitis, 
otitis media, bacteremia or pneumonia infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 
25. 

A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an individual 
susceptible to streptococcal infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 
25. 

A method according to claim 26, wherein said individual is 
a mammal . 

A method according to claim 27, wherein said individual is 
a human. 

A method according to claim 22, wherein said bacterial 
infection is S .pneumoniae , group A streptococcus 
(pyogenes) , group B streptococcus (GBS or agalactiae) , 
dysgalactiae, uteris, nocardia or Staphylococcus aureus. 



74 



A method according to claim 26, wherein said bacterial 
infection is S .pneumoniae . 
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ABSTRACT 



Streptococcus proteins and polynucleotides encoding them 
are disclosed. Said proteins are antigenic and therefore 
useful vaccine components for the prophylaxis or therapy . 
streptococcus infection in animals. Also disclosed are 
recombinant methods of producing the protein antigens as 
well as diagnostic assays for detecting streptococcus 
bacterial infection . 



ATGAAATTTA GTAAAAAATA TATAGCAGCT GGATCAGCTG TTATCGTATC CTTGAGTCTA 60 

TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 12 0 

TATGTGGATG GCAGCCAGTC AAGTCAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 180 

CAGAAAGAAG GAATTCAGGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGGCTATGTA 240 

ACGTCACACG GTGACCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 300 

AGTGAAGAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 360 

GAAGTCAAGG GTGGTTATAT CATCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 42 0 

GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 480 

GTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTGCTGTAG CAAGGTCTCA GGGACGATAT 54 0 

ACGACAAATG ATGGTTATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 600 

TATATCGTTC CTCATGGAGG TCACTATCAC TACATTCCCA AAAGCGATTT ATCTGCTAGT 660 

GAATTAGCAG CAGCTAAAGC ACATCTGGCT GGAAAAAATA TGCAACCGAG TCAGTTAAGC 72 0 

TATTCTTCAA CAGCTAGTGA CAATAACACG CAATCTGTAG CAAAAGGATC AACTAGCAAG 7 80 

CCAGCAAATA AATCTGAAAA TCTCCAGAGT CTTTTGAAGG AACTCTATGA TTCACCTAGC 840 

GCCCAACGTT ACAGTGAATC AGATGGCCTG GTCTTTGACC CTGCTAAGAT TATCAGTCGT 900 

ACACCAAATG GAGTTGCGAT TCCGCATGGC GACCATTACC ACTTTATTCC TTACAGCAAG 960 

CTTTCTGCTT TAGAAGAAAA GATTGCCAGA ATGGTGCCTA TCAGTGGAAC TGGTTCTACA 102 0 

GTTTCTACAA ATGCAAAACC TAATGAAGTA GTGTCTAGTC TAGGCAGTCT TTCAAGCAAT 10 80 

CCTTCTTCTT TAACGACAAG TAAGGAGCTC TCTTCAGCAT CTGATGGTTA TATTTTTAAT 114 0 

C C AAAAG AT A TCGTTGAAGA AACGGCTACA GCTTATATTG TAAGACATGG TGATCATTTC 12 0 0 

CATTAGATTC CAAAATCAAA TCAAATTGGG CAACCGACTC TTCCAAACAA TAGTCTAGCA 1260 

ACACCTTCTC CATCTCTTCC AATCAATCCA GGAACTTCAC ATGAGAAACA TGAAGAAGAT 132 0 

GGATACGGAT TTGATGCTAA TCGTATTATC GCTGAAGATG AATCAGGTTT TGTCATGAGT 13 80 

CACGGAGACC ACAATCATTA TTTCTTCAAG AAGGACTTGA CAGAAGAGCA AATTAAGGCT 144 0 

GCGCAAAAAC ATTTAGAGGA AGTTAAAACT AGTCATAATG GATTAG ATT C TTTGTCATCT 1500 

CATGAACAGG ATTATCCAGG TAATGCCAAA GAAATGAAAG ATTTAGATAA AAAAATCGAA 156 0 

GAAAAAATTG CTGGCATTAT GAAACAATAT GGTGTCAAAC GTGAAAGTAT TGTCGTGAAT 162 0 

AAAGAAAAAA ATGCGATTAT TTATCCGCAT GGAGATCACC ATCATGCAGA TCCGATTGAT 16 80 

GAACATAAAC CGGTTGGAAT TGGTCATTCT CACAGTAACT ATGAACTGTT TAAACCCGAA 1740 

GAAGGAGTTG CTAAAAAAGA AGGGAATAAA GTTTATACTG GAGAAGAATT AACGAATGTT 1800 

GTTAATTTGT TAAAAAATAG TACGTTTAAT AATCAAAACT TTACTCTAGC CAATGGTCAA 1860 

AAACGCGTTT CTTTTAGTTT TCCGCCTGAA TTGGAGAAAA AATTAGGTAT CAATATGCTA 192 0 

GTAAAATTAA TAACACCAGA TGGAAAAGTA TTGGAGAAAG TATCTGGTAA AGTATTTGGA 1980 

GAAGGAGTAG GGAATATTGC AAACTTTGAA TTAGATCAAC CTTATTTACC AGGACAAACA 2 04 0 

TTTAAGTATA CTATCGCTTC AAAAGATTAT CCAGAAGTAA GTTATGATGG TACATTTACA 2100 

GTTCCAACCT CTTTAGCTTA CAAAATGGCC AGTCAAACGA TTTTCTATCC TTTCCATGCA 2160 

GGGGATACTT ATTTAAGAGT GAACCCTCAA TTTGCAGTGC CTAAAGGAAC TGATGCTTTA 222 0 

GTCAGAGTGT TTGATGAATT TCATGGAAAT GCTTATTTAG AAAATAACTA TAAAGTTGGT 22 8 0 

GAAATCAAAT TACCGATTCC GAAATTAAAC CAAGGAACAA CCAGAACGGC CGGAAATAAA 2340 

ATTCCTGTAA CCTTCATGGC AAATGCTTAT TTGGACAATC AATCGACTTA TATTGTGGAA 24 0 0 

GTACCTATCT TGGAAAAAGA AAATCAAACT GATAAACCAA GTATTCTACC ACAATTTAAA 2460 

AGGAATAAAG CACAAGAAAA CTCAAAACTT GATGAAAAGG TAG AAGAAC C AAAGACTAGT 252 0 

GAGAAGGTAG AAAAAGAAAA ACTTTCTGAA ACTGGGAATA GTACTAGTAA TTCAACGTTA 2 5 80 

GAAGAAGTTC CTACAGTGGA TCCTGTACAA GAAAAAGTAG CAAAATTTGC TGAAAGTTAT 2640 

GGGATGAAGC TAGAAAATGT CTTGTTTAAT ATGGACGGAA CAATTGAATT ATATTTACCA 2 700 

TCAGGAGAAG TCATTAAAAA GAATATGGCA GATTTTACAG GAGAAGC AC C TCAAGGAAAT 2760 

GGTGAAAATA AACCATCTGA AAATGGAAAA GTATCTACTG GAACAGTTGA GAACCAACCA 2 82 0 

ACAGAAAATA AACCAGCAGA TTCTTTACCA GAGGCACCAA ACGAAAAACC TGTAAAACCA 2 880 

GAAAACTCAA CGGATAATGG AATGTTGAAT CCAGAAGGGA ATGTGGGGAG TGACCCTATG 2 940 

TTAGATCCAG CATTAGAGGA AGCTCCAGCA GTAGATCCTG TACAAGAAAA ATTAGAAAAA 3 00 0 

TTTACAGCTA GTTACGGATT AGGCTTAGAT AGTGTTATAT TCAATATGGA TGGAACGATT 3 060 

GAATTAAGAT TGCCAAGTGG AGAAGTGATA AAAAAGAATT TATCTGATTT CATAGCGTAA 312 0 
(SEQ ID NO: 1) FIGURE 1 



MKFSKKYIAA GSAVIVSLSL CAYALNQHRS 
SENLTPDQVS QKEGIQAEQI VIKITDQGYV 
SEELLMKDPN YQLKDADIVN EVKGGYIIKV 
DEINRQKQEH VKDNEKVNSN VAVARSQGRY 
YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA 
QSVAKGSTSK PANKSENLQS LLKELYDSPS 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR 
VSSLGSLSSN PSSLTTSKEL SSASDGYIFN 
HYIPKSNQIG QPTLPNNSLA TPSPSLPINP 
AEDESGFVMS HGDHNHYFFK KDLTEEQIKA 
HEQDYPGNAK EMKDLDKKIE EKIAGIMKQY 
GDHHHADPID EHKPVGIGHS HSNYELFKPE 
WLLKNSTFN NQNFTLANGQ KRVSFSFPPE 
LEKVSGKVFG EGVGNIANFE LDQPYLPGQT 
VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ 
AYLENNYKVG EIKLPIPKLN QGTTRTAGNK 
VPILEKENQT DKPSILPQFK RNKAQENSKL 
TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY 
SGEVIKKMMA DFTGEAPQGN GENKPSENGK 
EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM 
FTASYGLGLD SVIFNMDGTI ELRLPSGEVI 



QENKDNNRVS 


YVDGSQSSQK 


5 0 


TSHGDHYHYY 


NGKVPYDALF 


100 


DGKYYVYLKD 


AAHADNVRTK 


150 


TTNDGYVFNP 


ADIIEDTGNA 


200 


GKNMQPSQLS 


YS S TASD NIT T 


OCA 


AQRYSESDGL 


VFDPAKIISR 


300 


MVPISGTGST 


VSTNAKPNEV 


■5 r 

350 


PKDIVEETAT 


AYIVRHGDHF 


A A C\ 

40 0 


GTSHEKHEED 


GYGFDANRII 


A C f\ 

45 0 


AQKHLEEVKT 


SHNGLDSLSS 


500 


GVKRESIWN 


KEKNAIIYPH 


c C f\ 

550 


EGVAKKEGNK 


VYTGEELTNV 


600 


LEKKLGINML 


VKLITPDGKV 


650 


FKYTIASKDY 


PEVSYDGTFT 


700 


FAVPKGTDAL 


VRVFDEFHGN 


750 


IPVTFMANAY 


LDNQSTYIVE 


800 


DEKVEEPKTS 


EKVEKEKLSE 


850 


GMKLENVLFN 


MDGTIELYLP 


900 


VSTGTVENQP 


TENKPADSLP 


950 


LDPALEEAPA 


VDPVQEKLEK 


1000 


KKNLSDFIA 


(SEQ ID NO: 2) 
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ATGAAAATCA ATAAAAAATA TCTAGCTGGG TCAGTAGCTA CACTTGTTTT AAGTGTCTGT 60 

GCTTATGAAC TAGGTTTGCA TCAAGCTCAA ACTGTAAAAG AAAATAATCG TGTTTCCTAT 12 0 

ATAGATGGAA AACAAGCGAC GCAAAAAACG GAGAATTTGA CTCCTGATGA GGTTAGCAAG 18 0 

CGTGAAGGAA TCAACGCCGA ACAAATCGTC ATCAAGATTA CGGATCAAGG TTATGTGACC 240 

TCTCATGGAG AC CATTATCA TTACTATAAT GGCAAGGTCC CTTATGATGC CATCATCAGT 300 

GAAGAGCTCC TCATGAAAGA TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA 360 

ATCAAGGGTG GTTATGTCAT TAAGGTAAAC GGTAAATACT ATGTTTACCT TAAGGATGCA 42 0 

GCTCATGCGG ATAATGTCCG TACAAAAGAA GAAATCAATC GGCAAAAACA AGAACATAGT 4 80 

CAGCATCGTG AAGGAGGGAC TTCAGCAAAC GATGGTGCGG TAGCCTTTGC ACGTTCACAG 54 0 

GGACGCTACA CCACAGATGA TGGTTATATC TTCAATGCAT CTGATATCAT CGAAGATACG 600 

GGCGATGCCT ATATCGTTCC TCATGGAGAT CATTACCATT ACATTCCTAA GAATGAGTTA 660 

TCAGCTAGCG AGTTGGCTGC TGCAGAAGCC TTCCTATCTG GTCGGGAAAA TCTGTCAAAT 72 0 

TTAAGAACCT ATCGCCGACA AAATAGCGAT AACACTCCAA GAACAAACTG GGTACCTTCT 780 

GTAAGCAATC CAGGAACTAC AAATACTAAC ACAAGCAACA ACAGCAACAC TAACAGTCAA 84 0 

GCAAGTCAAA GTAATGACAT TGATAGTCTC TTGAAACAGC TCTACAAACT GCCTTTGAGT 900 

CAACGCCATG TAGAATCTGA TGGCCTTATT TTCGACCCAG CGCAAATCAC AAGTCGAACC 960 

GCCAGAGGTG TAGCTGTCCC TCATGGTAAC CATTACCACT TTATCCCTTA TGAACAAATG 102 0 

TCTGAATTGG AAAAACGAAT TGCTCGTATT ATTCCCCTTC GTTATCGTTC AAACCATTGG 1080 

GTACCAGATT CAAGACCAGA AGAACCAAGT CCACAACCGA CTCCAGAACC TAGTCCAAGT 114 0 

CCGCAACCTG CACCAAATCC TCAACCAGCT CCAAGCAATC CAATTGATGA GAAATTGGTC 12 00 

AAAGAAGCTG TTCGAAAAGT AGGCGATGGT TATGTCTTTG AGGAGAATGG AGTTTCTCGT 12 6 0 

TATATCCCAG CCAAGAATCT TTCAGCAGAA ACAGCAGCAG GCATTGATAG CAAACTGGCC 132 0 

AAGCAGGAAA GTTTATCTCA TAAGCTAGGA GCTAAGAAAA CTGACCTCCC ATCTAGTGAT 13 8 0 

CGAGAATTTT ACAATAAGGC TTATGACTTA CTAGCAAGAA TTCAC CAAGA TTTACTTGAT 1440 

AATAAAGGTC GACAAGTTGA TTTTGAGGCT TTGGATAACC TGTTGGAACG ACTCAAGGAT 15 0 0 

GTCTCAAGTG ATAAAGTCAA GTTAGTGGAT GATATTCTTG CCTTCTTAGC TCCGATTCGT 1560 

CATCCAGAAC GTTTAGGAAA ACCAAATGCG CAAATTACCT ACACTGATGA TGAGATTCAA 162 0 

GTAGCCAAGT TGGCAGGCAA GTACACAACA GAAGACGGTT ATATCTTTGA TCCTCGTGAT 1680 

ATAACCAGTG ATGAGGGGGA TGCCTATGTA ACTCCACATA TGACCCATAG CCACTGGATT 174 0 

AAAAAAGATA GTTTGTCTGA AGCTGAGAGA GCGGCAGCCC AGGCTTATGC TAAAGAGAAA 180 0 

GGTTTGACCC CTCCTTCGAC AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA 1860 

GAAGCTATCT ACAACCGCGT GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC 192 0 

AATCTTCAAT ATAC TGTAGA AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT 1980 

TACCATAACA TCAAATTTGA GTGGTTTGAC GAAGGCCTTT ATGAGGCACC TAAGGGGTAT 2 04 0 

ACTCTTGAGG ATCTTTTGGC GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG 2100 

CATTCAGATA ATGGTTTTGG TAACGCTAGC GACCATGTTC AAAGAAACAA AAATGGTCAA 2160 

GCTGATACCA ATCAAACGGA AAAACCAAGC GAGGAGAAAC CTCAGACAGA AAAACCTGAG 222 0 

GAAGAAACCC CTCGAGAAGA GAAACCACAA AGCGAGAAAC CAGAGTCTCC AAAAC CAAC A 22 8 0 

GAGGAACCAG AAGAAGAATC AC C AGAGGAA TCAGAAGAAC CTCAGGTCGA GACTGAAAAG 2 34 0 

GTTGAAGAAA AACTGAGAGA GGCTGAAGAT TTACTTGGAA AAATCCAGGA TCCAATTATC 240 0 

AAGTCCAATG C C AAAG AG AC TCTCACAGGA TTAAAAAATA ATTTACTATT TGGCACCCAG 2460 

GACAACAATA CTATTATGGC AGAAGCTGAA AAACTATTGG CTTTATTAAA GGAGAGTAAG 252 0 

TAA (SEQ ID NO: 3) 2523 
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MKINKKYLAG SVATLVLSVC 
ENLTPDEVSK REGINAEQIV 
EELLMKDPNY QLKDSDIVNE 
EINRQKQEHS QHREGGTSAN 
GDAYIVPHGD HYHYIPKNEL 
NTPRTNWVPS VSNPGTTNTN 
QRHVESDGLI FDPAQITSRT 
IPLRYRSNHW VPDSRPEEPS 
KEAVRKVGDG YVFEENGVSR 
AKKTDLPSSD REFYNKAYDL 
VSSDKVKLVD DILAFLAPIR 
EDGYIFDPRD ITSDEGDAYV 
GLTPPSTDHQ DSGNTEAKGA 
GSLIIPHYDH YHNIKFEWFD 
HSDNGFGNAS DHVQRNKNGQ 
SEKPESPKPT EEPEEESPEE 
KSNAKETLTG LKNNLLFGTQ 



AYELGLHQAQ TVKENNRVSY 
IKITDQGYVT SHGDHYHYYN 
IKGGYVIKVN GKYYVYLKDA 
DGAVAFARSQ GRYTTDDGYI 
SASELAAAEA FLSGRENLSN 
TSNNSNTNSQ ASQSNDIDSL 
ARGVAVPHGN HYHFIPYEQM 
PQPTPEPSPS PQPAPNPQPA 
YIPAKNLSAE TAAGIDSKLA 
LARIHQDLLD NKGRQVDFEA 
HPERLGKPNA QITYTDDEIQ 
TPHMTHSHWI KKDSLSEAER 
EAIYNRVKAA KKVPLDRMPY 
EGLYEAPKGY TLEDLLATVK 
ADTNQTEKPS EEKPQTEKPE 
SEEPQVETEK VEEKLREAED 
DNNTIMAEAE KLLALLKESK 

FIGURE 4 
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DUU 


NLQYTVEVKN 


650 


YYVEHPNERP 


700 


EETPREEKPQ 


750 


LLGKIQDPII 


800 


(SEQ ID NO: 4) 
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ATGGAGAATA TAGACATGTT TAAATCAAAT CATGAGCGAA GAATGCGTTA TTCCATTCGT 60 

AAATTTAGTG TAGGAGTAGC TAGCGTAGCT GTTGCCAGTC TTTTTATGGG AAGTGTTGTA 12 0 

CATGCGACAG AGAAAGAGGG AAGTACCCAA GCAGCCACTT CTTTTAATAG GGGAAATGGA 18 0 

AGTCAGGCAG AACAACGTGG AGAACTCGAT TTAGAACGAG ATAAGGCAAT GAAAGCGGTC 24 0 

AGTGAATATG TAGGAAAAAT GGTGAGAGAT GCCTATGTAA AAT C AG AT AG AAAACGACAT 3 00 

AAAAATACTG TAGCTCTAGT TAACCAGTTG GGAAACATTA AGAACAGGTA TTTGAATGAA 3 60 

ATAGTTCATT CAACCTCAAA AAGC CAACTA CAGGAACTGA TGATGAAGAG TCAATCAGAA 42 0 

GTAGATGAAG CTGTGTCTAA ATTTGAAAAG GACTCATTTT CTTCGTCAAG TTCAGGATCC 480 

TCCACTAAAC CAGAAACTCC GCAGCCGGAA AATCCAGAGC ATCAAAAACC AACAACTCCA 54 0 

TCTCCGGATA CCAAAC CAAG CCCTCAACCA GAAGGCAAGA AACCAAGCGT ACCAGACATT 600 

AATCAGGAAA AAGAAAAAGC TAAGCTTGCT GTAGTAACCT ACATGAGCAA GATTTTAGAT 660 

GATATACAAA AACATCATCT GCAGAAAGAA AAACAT CGTC AGATTGTTGC TCTTATTAAG 72 0 

GAGCTTGATG AGCTTAAAAA GCAAGCTCTT TCTGAAATTG ATAATGTAAA TACCAAAGTA 78 0 

GAAATTGAAA ATACAGTCCA CAAGATATTT GCAGACATGG ATGCAGTTGT GACTAAATTC 84 0 

AAAAAAGGCT TAACTCAGGA CACACCAAAA GAACCAGGTA ACAAAAAACC ATCTGCTCCA 90 0 

AAACCAGGTA TGCAACCAAG TCCTCAACCA GAGGTTAAAC CGCAGCTGGA AAAACCAAAA 960 

CCAGAGGTTA AACCGCAACC AGAAAAACCA AAACCAGAGG TTAAACCGCA GCCGGAAAAA 102 0 

CCAAAACCAG AGGTTAAACC GCAGCCGGAA AAACCAAAAC CAGAGGTTAA ACCGCAGCCG 10 80 

GAAAAACCAA AAC CAGAGGT TAAACCGCAG CCGGAAAAAC CAAAACCAGA GGTTAAACCG 114 0 

CAGCCGGAAA AACCAAAACC AGAGGTTAAA CCGCAGCCGG AAAAACCAAA ACCAGAGGTT 12 00 

AAACCGCAGC CGGAAAAACC AAAACCAGAG GTTAAACCGC AGCCGGAAAA ACCAAAACCA 12 60 

GAGGTTAAAC CGCAGCCGGA AAAACCAAAA CCAGAGGTTA AACCGCAACC AGAAAAACCA 13 2 0 

AAACCAGAGG TTAAACCGCA ACCAGAAAAA CCAAAACCAG ATAATAGCAA GCCACAAGCA 13 80 

GATGATAAGA AGCCATCAAC TACAAATAAT TTAAGCAAGG ACAAGCAACC TTCTAACCAA 1440 

GCTTCAACAA ACGAAAAAGC AACAAATAAA CCGAAGAAGT CATTGCCATC AACTGGATCT 150 0 

ATTTCAAATC TAGCACTTGA AATTGCAGGT CTTCTTACCT TGGCGGGGGC AACCATTCTT 1560 

GCTAAGAAAA GAATGAAATA G (SEQ ID NO: 5) 1581 
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MENIDMFKSM 
AATSFNRGNG 
KNTVALVNQL 
DSFSSSSSGS 
NQEKEKAKLA 
SEIDNVNTKV 
KPGMQPSPQP 
KPKPEVKPQP 
KPQPEKPKPE 
PKPDNSKPQA 
ISNLALEIAG 



HERRMRYSIR 
SQAEQRGELD 
GNIKNRYLNE 
STKPETPQPE 
WTYMSKILD 
EIEMTVHKIF 
EVKPQLEKPK 
EKPKPEVKPQ 
VKPQPEKPKP 
DDKKPSTTNN 
LLTLAGATIL 



KFSVGVASVA 
LERDKAMKAV 
IVHSTSKSQL 
NPEHQKPTTP 
DIQKHHLQKE 
ADMDAWTKF 
PEVKPQPEKP 
PEKPKPEVKP 
EVKPQPEKPK 
LSKDKQPSNQ 
AKKRMK 



VASLFMGSW 
SEYVGKMVRD 
QELMMKSQSE 
SPDTKPSPQP 
KHRQIVALIK 
KKGLTQDTPK 
KPEVKPQPEK 
QPEKPKPEVK 
PEVKPQPEKP 
ASTNEKATNK 



HATEKEGSTQ 
AYVKSDRKRH 
VDEAVSKFEK 
EGKKPSVPDI 
ELDELKKQAL 
EPGNKKPSAP 
PKPEVKPQPE 
PQPEKPKPEV 
KPEVKPQPEK 
PKKSLPSTGS 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
526 



(SEQ ID NO: 6) 



FIGURE 6 



ATGAAATTTA GTAAAAAATA TATAGCAGCT GGATCAGCTG TTATCGTATC CTTGAGTCTA 
TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 
TATGTGGATG GCAGCCAGTC AAGTCAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 
CAGAAAGAAG GAATTCAGGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGGCTATGTA 
ACGTCACACG GTGACCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 
AGTGAAGAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 
GAAGTCAAGG GTGGTTATAT CATCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 
GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 
GTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTGCTGTAG CAAGGTCTCA GGGACGATAT 
ACGACAAATG ATGGTTATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 
TATATCGTTC CTCATGGAGG TCACTATCAC TACATTCCCA AAAGCGATTT ATCTGCTAGT 
GAATTAGCAG CAGCTAAAGC ACATCTGGCT GGAAAAAATA TGCAACCGAG TCAGTTAAGC 
TATTCTTCAA CAGCTAGTGA GAATAACACG CAATCTGTAG CAAAAGGATC AACTAGCAAG 
CCAGCAAATA AATCTGAAAA TCTCCAGAGT CTTTTGAAGG AACTCTATGA TTCACCTAGC 
GCCCAACGTT ACAGTGAATC AGATGGCCTG GTCTTTGACC CTGCTAAGAT TATCAGTCGT 
ACACCAAATG GAGTTGCGAT TCCGCATGGC GACCATTACC ACTTTATTCC TTACAGCAAG 
CTTTCTGCTT TAGAAGAAAA GATTGCCAGA ATGGTGCCTA TCAGTGGAAC TGGTTCTACA 
GTTTCTACAA ATGCAAAACC TAATGAAGTA GTGTCTAGTC TAGGCAGTGT TTCAAGCAAT 
CCTTCTTCTT TAACGACAAG TAAGGAGCTC TCTTCAGCAT CTGATGGTTA TATTTTTAAT 
C C AAAAG AT A TCGTTGAAGA AACGGCTACA GCTTATATTG TAAGACATGG TGATCATTTC 
CATTACATTC CAAAATCAAA TCAAATTGGG CAACCGACTC TTCCAAACAA TAGTCTAGCA 
ACACCTTCTC CATCTCTTCC AATCAATCCA GGAACTTCAC ATGAGAAACA TGAAGAAGAT 
GGATACGGAT TTGATGCTAA TCGTATTATC GCTGAAGATG AATCAGGTTT TGTCATGAGT 
CACGGAGACC ACAATCATTA TTTCTTCAAG AAGGACTTGA CAGAAGAGCA AATTAAGGTG 
CGCAAAAACA TTTAG (SEQ ID NO: 7) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1455 



FIGURE 7 



MKFSKKYIAA 
SENLTPDQVS 
SEELLMKDPN 
DEINRQKQEH 
YIVPHGGHYH 
QSVAKGSTSK 
TPNGVAIPHG 
VSSLGSLSSN 
HYIPKSNQIG 
AEDESGFVMS 



GSAVIVSLSL 
QKEGIQAEQI 
YQLKDADIVN 
VKDNEKVNSN 
YIPKSDLSAS 
PANKSENLQS 
DHYHFIPYSK 
PSSLTTSKEL 
QPTLPNNSLA 
HGDHNHYFFK 



CAYALNQHRS 
VIKITDQGYV 
EVKGGYIIKV 
VAVARSQGRY 
ELAAAKAHLA 
LLKELYDSPS 
LSALEEKIAR 
SSASDGYIFN 
TPSPSLPINP 
KDLTEEQIKV 



QENKDNNRVS YVDGSQSSQK 
TSHGDHYHYY NGKVPYDALF 
DGKYYVYLKD AAHADNVRTK 
TTNDGYVFNP ADIIEDTGNA 
GKNMQPSQLS YSSTASDNNT 
AQRYSESDGL VFDPAKIISR 
MVPISGTGST VSTNAKPNEV 
PKDIVEETAT AYIVRHGDHF 
GTSHEKHEED GYGFDANRII 
RKNI (SEQ ID NO: 8) 



50 
100 
150 
200 
250 
300 
350 
400 
450 
484 



FIGURE 8 



ATGAAAGATT TAGATAAAAA AATCGAAGAA AAAATTGCTG GCATTATGAA ACAATATGGT 
GTCAAACGTG AAAGTATTGT CGTGAATAAA GAAAAAAATG CGATTATTTA TCCGCATGGA 
GATCACCATC ATGCAGATCC GATTGATGAA CATAAACCGG TTGGAATTGG TCATTCTCAC 
AGTAACTATG AACTGTTTAA ACC CGAAGAA GGAGTTGCTA AAAAAGAAGG GAATAAAGTT 
TATACTGGAG AAGAATTAAC GAATGTTGTT AATTTGTTAA AAAATAGTAC GTTTAATAAT 
CAAAACTTTA CTCTAGCCAA TGGTCAAAAA CGCGTTTCTT TTAGTTTTCC GCCTGAATTG 
GAGAAAAAAT TAGGTATCAA TATGCTAGTA AAATTAATAA CACCAGATGG AAAAGTATTG 
GAGAAAGTAT CTGGTAAAGT ATTTGGAGAA GGAGTAGGGA ATATTGCAAA CTTTGAATTA 
GATCAACCTT ATTTACCAGG ACAAACATTT AAGTATACTA TCGCTTCAAA AGATTATCCA 
GAAGTAAGTT ATGATGGTAC ATTTACAGTT CCAACCTCTT TAGCTTACAA AATGGCCAGT 
CAAACGATTT TCTATCCTTT CCATGCAGGG GATACTTATT TAAGAGTGAA CCCTCAATTT 
GCAGTGCCTA AAGGAACTGA TGCTTTAGTC AGAGTGTTTG ATGAATTTCA TGGAAATGCT 
TATTTAGAAA ATAACTATAA AGTTGGTGAA ATCAAATTAC CGATTCCGAA ATTAAACCAA 
GGAACAACCA GAACGGCCGG AAATAAAATT CCTGTAACCT TCATGGCAAA TGCTTATTTG 
GACAATCAAT CGACTTATAT TGTGGAAGTA CCTATCTTGG AAAAAGAAAA TCAAACTGAT 
AAACCAAGTA TTCTACCACA ATTTAAAAGG AATAAAGCAC AAGAAAACTC AAAACTTGAT 
GAAAAGGTAG AAGAAC C AAA GACTAGTGAG AAGGTAGAAA AAGAAAAACT TTCTGAAACT 
GGGAATAGTA CTAGTAATTC AACGTTAGAA GAAGTTCCTA CAGTGGATCC TGTACAAGAA 
AAAGTAGCAA AATTTGCTGA AAGTTATGGG ATGAAGCTAG AAAATGTCTT GTTTAATATG 
GACGGAACAA TTGAATTATA TTTACCATCA GGAGAAGTCA TTAAAAAGAA TATGGCAGAT 
TTTACAGGAG AAGCACCTCA AGGAAATGGT GAAAATAAAC CATCTGAAAA TGGAAAAGTA 
TCTACTGGAA CAGTTGAGAA CCAACCAACA GAAAATAAAC CAGCAGATTC TTTACCAGAG 
GCACCAAACG AAAAACCTGT AAAACCAGAA AACTCAACGG ATAATGGAAT GTTGAATCCA 
GAAGGGAATG TGGGGAGTGA CCCTATGTTA GATCCAGCAT TAGAGGAAGC TCCAGCAGTA 
GATCCTGTAC AAGAAAAATT AGAAAAATTT ACAGCTAGTT ACGGATTAGG CTTAGATAGT 
GTTATATTCA ATATGGATGG AACGATTGAA TTAAGATTGC CAAGTGGAGA AGTGATAAAA 
AAGAATTTAT CTGATTTCAT AGCGTAA {SEQ ID NO: 9) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1587 



FIGURE 9 



MKDLDKKIEE KIAGIMKQYG 
HKPVGIGHSH SNYELFKPEE 
QNFTLANGQK RVSFSFPPEL 
GVGNIANFEL DQPYLPGQTF 
QTIFYPFHAG DTYLRWPQF 
IKLPIPKLNQ GTTRTAGNKI 
KPSILPQFKR NKAQENSKLD 
EVPTVDPVQE KVAKFAESYG 
FTGEAPQGNG ENKPSENGKV 
NSTDNGMLNP EGNVGSDPML 
VIFNMDGTIE LRLPSGEVIK 



VKRESIWNK EKNAIIYPHG 
GVAKKEGNKV YTGEELTNW 
EKKLGINMLV KLITPDGKVL 
KYTIASKDYP EVSYDGTFTV 
AVPKGTDALV RVFDEFHGNA 
PVTFMANAYL DNQSTYIVEV 
EKVEEPKTSE KVEKEKLSET 
MKLENVLFNM DGTIELYLPS 
STGTVENQPT ENKPADSLPE 
DPALEEAPAV DPVQEKLEKF 
KNLSDFIA (SEQ ID NO: 



T\TTTTTT7V T\ Tl T TMIP 

DHHHADPIDE 
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NLLKNSTFNjSI 
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1 tr n 


PTSLAYKMAb 


o n n 
Z U U 


YLEKNYKVGE 


2 50 


PILEKENQTD 


o a n 
jUU 


GNSTSNSTLE 


350 


GEVIKKNMAD 


400 


APNEKPVKPE 


450 


TASYGLGLDS 


500 
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528 



FIGURE 10 



BVH3 WU2 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 50 

BVH3 RX1 1 C^YALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

BVH3 JNR7/87 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

BVH3 SP64 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

BVH3 P4241 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKSGIQAEQIVIKITDQGYV 60 

BVH3 A66 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 
*************************************************** 

BVH3 WU2 61 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLKD 120 

BVH3 RX1 61 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDAD IVNEVKGGYI I KVDGKYYVYLKD 120 

BVH3 JNR7/87 61 T S HGDH YH YYNG KVP YD ALF S EE LLMKD PNYQLKDAD I VNE VKGG YI I KVD GKY YVYLKD 120 

BVH3 SP64 61 T S HGDH YHYYNGKVPYD ALF SE E LLMKD PNYQLKDAD I VNE VKGG YI I KVD G KYYVYLKD 120 

BVH3 P4241 61 TSHGDHYHYYNGKVPYDALFSEELLMKD PNYQLKDAD I VNE VKGG YI I KVDGKYYVYLKD 120 

BVH3 A66 61 TSHGDHYHY YNGKVP YDALFSEELLMKDPNYQLKDAD I VNE VKGGYI I KVDGKYYVYLKD 120 
************************************************************ 

BVH3 WU2 121 AAHADNVRTKDE INRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADI I EDTGNA 180 

BVH3 RX1 121 AAHADNVRTKDE INRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADI I EDTGNA 180 

BVH3 JNR7/87 121 AAHADNVRTKDEINRQKQEHVKDNEKWSNVAVARSQGRYTTNDGYVFNPADI I EDTGNA 180 

BVH3 SP64 121 AAHADNVRTKDE INRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADI I EDTGNA 180 

BVH3 P4241 121 AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADI I EDTGNA 180 

BVH3 A66 121 AAHADNVRTKDE I NRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFN PAD 1 1 EDTGNA 180 
************************************************************ 

BVH3 WU2 181 YIVPHRGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 

BVH3 RX1 181 YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 

BVH3 JNR7/87 181 YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 

BVH3 SP64 181 YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 

BVH3 P4241 181 YIVPHRGHYHYIPKSDLSASEIJW^KAHl^AGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 

BVH3 A66 181 YIVPHRGHYHYIPKSDLSASELiAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 
***** ****************************************************** 

BVH3 WU2 241 P ANKS ENLQ S LLKE L YD S P S AQRY S E SDGLVFD P AKI I S RTPNG VAI PHGDH YH F I P YS K 300 

BVH3 RX1 241 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKI ISRTPNGVAI PHGDHYHFI PYSK 300 

BVH3 JNR7/87 241 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKI ISRTPNGVAI PHGDHYHFI PYSK 300 

BVH3 SP64 241 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKI ISRTPNGVAI PHGDHYHFI PYSK 300 

BVH3 P4241 241 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSK 300 

BVH3 A66 241 PANKS ENLQ S LLKE L YDS PSAQRYS E SDGLVFD PAKI ISRTPNGVAI PHGDHYHFI PYSK 300 
************************************* *********************** 

BVH3 WU2 301 LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 360 

BVH3 RX1 301 LS ALEE K I ARRVP I S GTG S T VS TNAKPNE WS S LG SLSSNPSS LTT S KE LS S AS DG Y I FN 360 

BVH3 JNR7/87 301 LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 360 

BVH3 SP64 301 LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 360 

BVH3 P4241 301 LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 360 

BVH3 A66 301 LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 360 
********** ************************************************* 

BVH3 WU2 361 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 420 

BVH3 RX1 361 PKDIVEETATAYIVRHGDHFHYI PKSNQIGQPTLPNNSLATPSPSLPINPGI SHEKHEED 420 

BVH3 JNR7/87 361 PKDIVEETATAYIVRHGDHFHYI PKSNQ I GQPTLPNNSLATPS PS LP INPGTSHEKHEED 420 

BVH3 SP64 361 PKDIVEETATAYIVRHGDHFHYI PKSNQIGQPTLPNNSLATPSPSLP INPGTSHEKHEED 420 

BVH3 P4241 361 PKDIVEETATAYIVRHGDHFHYI PKSNQIGQPTLPNNSLATPSPSLP INPGTSHEKHEED 420 

BVH3 A66 361 PKDIVEETATAYIVRHGDHFHYI PKSNQ I GQPTLPNNSLATPS PS LP INPGTSHEKHEED 420 
*************************************************** ******** 

BVH3 WU2 421 G YG FD ANR 1 1 AEDE SG F VM S HGDHNH Y F F KKD LTE EQ I KAAQKHLE E VKTSHNG LD S L S S 480 

BVH3 RX1 421 GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 480 

BVH3 JNR7/87 421 GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 480 

BVH3 SP64 421 GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 480 

BVH3 P4241 421 GYGFDANRIIAEDESGFVNSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 480 

BVH3 A66 421 G YG FD ANRI I AEDESG F VMSHGDHNH YF F KKD LT E EQ I KAAQKHLE EVKT SHNGLDS L S S 480 
***************** ^**** ************************************* 



BVH3 WU2 



4 81 HEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPID 540 



BVH3 RX1 481 HEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPID 540 

BVH3 JNR7/87 4 81 HE QD Y P SNAKE MKD LD KKI E EKI AG I MKQ YG VKRE S I WNKE KNA 1 1 Y PHGDHHHAD P I D 54 0 

BVH3 SP64 481 HEQDYPGNAKEMKDLDKKIEEKI AGIMKQYGVKRES I WNKEKNAI I YPHGDHHHADPID 540 

BVH3 P4241 481 HEQDYPSNAKEMKDLDKKIEEKI AGIMKQYGVKRES I WNKEKNAI I YPHGDHHHADPID 540 

BVH3 A66 481 HEQDYPSNAKEMKDLDKKIEEKI AG I MKQYGVKRES I WNKEKNAI I YPHGDHHHADPID 540 
****** ***************************************************** 

BVH3 WU2 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNFTLANGQ 600 

BVH3 RX1 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKYYTGEELTNWNLLKNSTFNNQNFTLANGQ 600 

BVH3 JNR7/87 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNFTLANGQ 600 

BVH3 SP64 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNFTliANGQ 600 

BVH3 P4241 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNVVNLLKNSTFNNQNFTLANGQ 600 

BVH3 A66 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLK1JSTFNNQNFTLANGQ 600 
************************************************************ 

BVH3 WU2 601 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 

BVH3 RX1 601 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 

BVH3 JNR7/87 601 KRVS F S F P P ELE KKLG INMLVKL I T P DGKVLE KVSGKVFGE G VGN I ANF E LDQ P YL PGQT 660 

BVH3 SP64 601 KRVS F S F P P E LE KKLG I NMLVKL I T PDGKVLE KVSGKVFGE G VGN I ANF E LDQ PYL PGQT 660 

BVH3 P4241 601 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 

BVH3 A66 601 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 
************************************************************ 



BVH3 WU2 
BVH3 RX1 



661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 

BVH3 RX1 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 

BVH3 JNR7/87 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 

BVH3 SP64 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 

BVH3 P4241 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 

BVH3 A66 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 
************************************************************ 

BVH3 WU2 721 VRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE 780 

BVH3 RX1 721 VRVFDEFHGNAYLENNYKVGEI KLPI PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYI VE 780 

BVH3 JNR7/87 721 VRVFDEFHGNAYLENNYKVGEI KLPI PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYI VE 780 

BVH3 SP64 721 VRVFDEFHGNAYLENNYKVGEI KLPI PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYI VE 780 

BVH3 P4241 721 VRVFDEFHGNAYLENNYKVGE I KLPI PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYI VE 780 

BVH3 A66 721 VRVFDEFHGNAYLENNYKVGE IKLP I PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYIVE 780 
************************************************************ 

BVH3 WU2 781 VPILEKENQTDKPSILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

BVH3 RX1 781 VPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

BVH3 JNR7/87 781 VPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

BVH3 SP64 781 VPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

BVH3 P4241 781 VPILEKENQTDKPSILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

BVH3 A66 781 VPILEKENQTDKPSILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 
*************************** * ****************************** 

BVH3 WU2 841 E E VPT VD P VQE KVAKF AE S YGMKLENVLFNMDGT IELYLPSGEVI KKNMAD F TGE APQGN 900 

BVH3 RX1 841 EEVPTVD PVQEKVAKFAE S YGMKLENVLFNMDGT I ELYLPSGE VI KKNMADFTGE APQGN 900 

BVH3 JNR7/87 841 EE VPT VDP VQE KVAKF AES YGMKLENVLFNMDGT I ELYLPSGE VI KKNMADFTGE APQGN 900 

BVH3 SP64 841 EE VPTVDPVQEKVAKF AES YGMKLENVLFNMDGT I ELYLPSGEVIKKNMADFTGEAPQGN 900 

BVH3 P4241 841 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGT I ELYLPSGEVIKKNMADFTGEAPQGN 900 

BVH3 A66 841 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGT I ELYLPSGEVIKKNMADFTGEAPQGN 900 
************************************************************ 

BVH3 WU2 901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

BVH3 RX1 901 GENKP S ENGKVS T GT VENQ PTENKPAD S L P E APNE KP VKP ENS TDNGMLNP EGNVGS D PM 960 

BVH3 JNR7/87 901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

BVH3 SP64 901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

BVH3 P4241 901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

BVH3 A66 901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 
************************************************************ 



BVH3 WU2 
BVH3 RX1 



961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 1019 

BVH3 RX1 961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA 1019 

BVH3 JNR7/87 961 LD P ALE E AP AVD P VQE KLEKF TAS YGLGLD S VI FNMDGT IELRLPSG E VI KKNL S DL I A 1019 

BVH3 SP64 961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDFIA 1019 

BVH3 P4241 961 LDPALEEAPAVDPVQEKLEKFTAS YGLGLD S VI FNMDGTIELRLPSGEVIKKNLSDLIA 1019 

BVH3 A66 961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 1019 

******************************************************** ** 



FIGURE 11 



BVH11-2 SP64 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11-2 JNR7/87 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11-2 P4241 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11-2 A66 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11-2 WU2 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11-2 Rxl 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 P4241 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 WU2 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 A66 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 Rxl 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 JNR7/87 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 SP63 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 SP64 1 CAYELGLHQA-QTVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGY 59 
* t **** *** * ** ***.**** ** ** . ******************* ********* 



BVH11-2 SP64 61 VT S HGDHYH YYNGKVP YDA USEE LLMKD PNYQLKD S D I VNE I KGG YVI KVDGKY YVYL K 120 

BVH11-2 JNR7/87 61 WSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIWEIKGGYVIKVDGKYYVYLK 120 

BVH11-2 P4241 61 WSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIWEIKGGWIKVNGKYYVYLK 120 

BVHU-2 A66 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLK 120 

BVH11-2 WU2 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLK 120 

BVH11-2 Rxl 61 VTS HGDHYH YYNGKVP YDA 1 1 SEELLMKDPNYQLKDSD I VNE I KGGYVI KVDGKY YVYLK 120 

BVH11 P4241 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIWEIKGGYVIKWGKYYVYLK 120 

BVH11 WU2 61 VTSHGDHYHYYNGKVPYDAI I SEE LLMKD PNYQLKD SD I VNE I KGG YVI KVNGKYYGYLK 120 

BVH11 A66 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSD I WE I KGG YVI KVNGKYYVYLK 120 

BVH11 Rxl 61 VT S HGDH YHY YNG KVP YDAI I S E E LLMKD PNYQLKD SD I VNE I KGG YV I KVDGKYTVYLK 120 

BVH11 JNR7/87 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSD I VNE I KGGYVI KVNGKYYVYLK 120 

BVH11 SP63 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIWEIKGGYVIKVDGKYYVYLK 120 

BVH11 SP64 60 VTSHGDHYHYYNGKVPYDAIISEELl^KDPNYQLKDSDIWEIKGGYVIKVNGKYYVYLK 119 
*************************************************** **** *** 
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121 DAAHADNIRTKEE I KRQKQEHSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI I E 177 

121 DAAHADNIRTKEE I KRQKQEHSHNHGGGSN- - DQAWAARAQGRYTTDDGYI FNASDI I E 178 

121 DAAHADNIRTKEE I KRQKQEHSHNHGGGSN- -DQAWAARAQGRYTTDDGYI FNASDI IE 178 

121 DAAHADNIRTKEEIKRQRQEHSHNHGGGSN- - DQAWAARAQGRYTTDDGYI FNASDI IE 178 

121 DAAHADNIRTKEE I KRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASDI IE 178 

121 DAAHADNIRTKEE I KRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASD I I E 177 

121 DAAHADNIRTKEE I KRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASDI IE 178 

121 DAAHADNIRTKEE I KRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASDI IE 178 

121 DAAHADNIRTKEE IKRQKQEHSHNHGGGSN- -DQAWAARAQGRYTTDDGYI FNASDI IE 178 

121 DAAHADNIRTKEE I KRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI I E 177 

121 DAAHADNIRTKEE I KRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI I E 177 

121 DAAHADNIRTKEE I KRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI IE 177 

120 DAAHADNVRTKEEINRQKQEHSQHREGGTSANDGAVAFARSQGRYTTDDGYI FNASDI I E 179 



******* ****** ** ** , 



** ******************* 
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178 DTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPVQPRLSEN 237 

179 DTGD AY I V PHGDH YH Y I P KNE L S AS ELAAAE AYWNG KQGSRPSSSSS YNANP AQPRL S EN 238 
179 DTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 
179 DTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 
179 DTGDAYIVPRGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 

178 DTGDAYIVPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 237 

179 DTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 
179 DTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 

179 DTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 
178 DTGDAYIVPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 237 
178 DTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQPRLSEN 237 
178 DTGDAYIVPHGNHFHYI P KS D LS AS ELAAAQAYWNGKQG S R P S S S SSHNANPAQPRLSEN 23 7 

180 DTGDAYIVPHGDHYHYI PKNELSASELAAAEAFLSGRENLSNLRTYRRQNSDNTPRTNWV 239 
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238 HNLTVTPTYHQN QGENI S S LLRE LYAKP LS ERH VE SDGL I FD P AQ ITS 285 

239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVE SDGL I FDPAQITS 286 

239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERRVESDGLI FDPAQITS 286 

23 8 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 285 

239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

238 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 285 

238 HNLTVTPTYHQN QGENISSLLRELYAKPLSERHVESDGLIFDPAQITS 285 

238 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGL I FDPAQITS 285 

240 PSVSNPGTTNTNTSNNSNTNSQASQSND I DSLLKQLYKLPLSQRHVESDGLI FDPAQITS 299 

* * * * *** ** *** * *************** 
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286 RTARGVAVPHGNHYHF I PYEQMSELEKRIARI I PLRYRSNHWVPDSRPEQPS PQSTPEPS 345 
87 287 RTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPS 346 

2 87 RTARGVAVPHGNHYHF I PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPS PQ PS 342 

2 87 RTARGVAVPHGNHYHF I PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPS PQ PS 342 

287 RTARGVAVPHGNHYHF I PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPS PQ PS 342 

286 RTANGVAVPHGDHYHF I PYSQLSPLEEKLARI I PLRYRSNHWVPDSRPEQPS PQSTPEPS 345 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQ PS PQ PS 342 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ PS 342 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ PS 342 

286 RTANGVAVPHGDHYHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQPSPQSTPEPS 345 
286 RTARGVAVPHGNHYHF I PYEQMSELEKRIARI I PLRYRSNHWVPDSRPEEPSPQPT PEPS 345 
286 RTARGVAVPHGNHYHF I PYSQMSELEERIARI I PLRYRSNHWVPDSRPEQPS PQSTPEPS 345 
300 RTARGVAVPHGNHYHF I PYEQMSELEKRIARI I PLRYRSNHWVPDSRPEEPSPQPTPEPS 359 



*** ******* ******* *. 



******************** . **** 



BVH11-2 SP64 346 PS LQ P APNPQ P AP SN P I DE KLVKE AVRKVGD G YVF EENGVS RY I PAKD L S AET AAG I D S K 405 

BVH11-2 JNR7/87 347 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 406 

BVH11-2 P4241 343 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 

BVH11-2 A66 343 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 

BVH11-2 WU2 343 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 

BVH11-2 Rxl 346 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVPRYIPAKDLSAETAAGIDSK 405 

BVH11 P4241 343 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 

BVH11 WU2 343 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 

BVH11 A66 343 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 402 

BVH11 Rxl 346 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVPRYIPAKDLSAETAAGIDSK 405 

BVH11 JNR7/87 346 PSP QPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSK 399 

BVH11 SP63 346 PSPQSAPNPQPAPSNPIDEKLVKEWRKVGDGYVFEKNGVSRYIPAKNLSAETAAGIDSK 405 

BVH11 SP64 360 PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKNLSAETAAGIDSK 419 
** *************** *********** *** ****** ************ 
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4 06 LAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEVLDNLLERL 465 
407 LAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 466 
4 03 LAKQESLSHKLGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 4 62 
403 LAKQESLSHKLGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 462 
403 LAKQE S LSHKLGTKKTDLP S S DRE F YNKAYD LLAR I HQDL LDNKGRQVD F E ALDNLLE RL 462 
406 LAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 465 
403 LAKQESLSHKLGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 462 
403 LAKQE SLSHKLGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 462 
403 LAKQESLSHKLGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 462 
406 LAKQE S LS HKLGAKKTDLP S SDRE F YNKAYDLLAR I HQD LLDNKGRQ VD F E ALDNLLE RL 4 65 
4 00 LAKQE SLSHKLGAKKTDLPS SDRE FYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 459 
4 06 LAKQE SLSHKLGAKKTDLPS SDRE FYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 465 
42 0 LAKQ E S LSHKLG AKKTD L P S SDRE F YNKAYDLLAR I HQDLLDNKGRQ VD F E ALDNL LERL 479 



BVH11-2 SP64 466 KDVSSDKVKLVDDILiAF]^APIRHPERLGKPNAQITYTDDE IQVAKLAGKYTTEDGYIFDP 525 

BVH11-2 JNR7/87 467 KDVPSDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDP 526 

BVH11-2 P4241 463 KDVSSDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIFDP 522 

BVH11-2 A66 463 KDVSSDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIFDP 522 

BVH11-2 WU2 463 KDVSSDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIFDP 522 

BVH11-2 Rxl 466 KDVSSDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDP 525 

BVH11 P4241 463 KDVSSDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIFDP 522 

BVH11 WU2 463 KDVS S D KVKL VED I LAFLAP I RH PERLG K PNS Q I T YTDDE I Q VAKLAGKYTTE DG Y I FD P 522 

BVH11 A66 463 KDVSSDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIFDP 522 

BVH11 Rxl 466 KDVSSDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAICLAGKYTTEDGYIFDP 525 

BVH11 JNR7/87 460 KDVS SDKVKLVDDI LAFLAP I RHPERLGKPNAQ IT YTDDE IQVAKLAGKYTTEDGYIFDP 519 

BVH11 SP63 466 EDVP SDKVKLVDDI LAFLAP I RHPERLGKPNAQ IT YTDDE IQVAKLAGKYTTEDGYIFDP 525 

BVH11 SP64 4 80 KDVS SDKVKLVDDI LAFLAP I RHPERLGKPNAQ IT YTDDE IQVAKLAGKYTTEDGYIFDP 539 
** ******* # ******************* **************************** 

BVH11-2 SP64 526 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 5 85 

BVH11-2 JNR7/87 527 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 586 

BVH11-2 P4241 523 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHRDSGNTEAK 582 

BVH11-2 A66 523 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 582 

BVH11-2 WU2 523 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 582 

BVH11-2 Rxl 526 RD I T S D EGD AYVTPHMTH SHW I KKD S L S E AERAAAQ AYAKE KG LT P P S TDHQD SGNT E AK 585 

BVH11 P4241 523 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 582 

BVH11 WU2 523 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 5 82 

BVH11 A66 523 RD I T S D EGD AYVTPHMTH SHWI KKD S L S E AERAAAQAYAKE KGLT P P S TDHQD S GNTE AK 582 

BVH11 Rxl 526 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 585 

BVH11 JNR7/87 520 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 579 

BVH11 SP63 526 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 585 

BVH11 SP64 54 0 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 599 
*********************************************************** 



BVH11-2 SP64 586 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGS LI I PHYDHYHNIKFEWFDEGLYEAPK 645 

BVH11-2 JNR7/87 587 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNIKFEWFDEGLYEAPK 646 

BVH11-2 P4241 583 GAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGS LI I PHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11-2 A66 5 83 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGS LI I PHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11-2 WU2 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGS L 1 1 PHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11-2 Rxl 586 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGS LI I PHYDHYHNIKFEWFDEGLYEAPK 645 

BVH11 P4241 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11 WU2 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGS LI I PHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11 A66 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11 Rxl 586 GAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFDEGLYEAPK 645 

BVH11 JNR7/8 7 58 0 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGS LI I PHYDHYHNIKFEWFDEGLYEAPK 639 

BVH11 SP63 586 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGS LI I PHYDHYHNIKFEWFDEGLYEAPK 645 

BVH11 SP64 600 GAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSL I I PHYDHYHNIKFEWFDEGLYEAPK 659 
*************************************************** ********* 
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646 GYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK ADQDSK 690 

647 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK VDQDSK 691 

643 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK ADQDSK 687 

643 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK ADQDSK 687 

643 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK ADQDSK 687 

646 GYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTEKPNEEKPQTEK 705 

643 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK ADQDSK 687 

643 G YTL EDL LAT VKYYVEH PNERPHSDNG FGNASDH VRKNK ADQDSK 687 

643 G YTLED LLATVKYYVEH PNE RPHSDNG FGNASDHVRKNK ADQDSK 687 

646 GYS LED LLATVKYYVEH PNE RPHSDNGFGNASDHVQRNK NGQ 687 

640 GYS LE D LLATVKYYVEH PNE R PHSDNGFGNASDH VQRN K NGQ 681 

64 6 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNK NGQ 687 

660 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNK NGQ 701 



BVH11-2 SP64 691 PDEDKEHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 750 
BVH11-2 JNR7/87 692 PDEDKEHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 751 

BVH11-2 P4241 688 PDED KGHDE VS E PTHPE SDE KENHAGLN P S ADNLYKP STDTE ETE E E AEDTTD E AE I PQ V 747 

BVH11-2 A66 688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 747 

BVH11-2 WU2 688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 747 

BVH11-2 Rxl 706 PEEDKEHDEVSEPTHPESDEKENHVGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 765 

BVH11 P4241 688 PDEDKGHDEVSEPTHPESDEKENHAG LNPSADNLYKPSTDTEETEEE AEDTTD EAE I PQV 747 

BVH11 WU2 688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 747 

BVH11 A66 688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQV 74 7 

BVH11 Rxl 688 ADTNQTEKPNEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEEESPEESPEESEEPQV 747 

BVH11 JNR7/87 682 ADTNQTEKPNEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEEESPEESPEESEEPQV 741 

BVH11 SP63 688 ADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESP KPTEEPEEESPEESEEPQV 743 

BVH11 SP64 702 ADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESP KPTEEPEEESPEESEEPQV 757 

*.***, * * ** *.. .*.* *** 

BVH11-2 SP64 751 ENSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 810 

BVH11-2 JNR7/87 752 ENSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 811 

BVH11-2 P4241 748 EHSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 807 

BVH11-2 A66 748 EHSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 807 

BVH11-2 WU2 748 EHSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 807 

BVH11-2 Rxl 766 EYSVINAKIAEAEALLEICVTDSSIRQNAVETLTGLKSSLLLGTKDNNTISAEVDSLLALL 825 

BVH11 P4241 748 EHSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 807 

BVH11 WU2 74 8 EHSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 8 07 

BVH11 A66 748 EHSVINAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTISAEVDSLLALL 807 

BVH11 Rxl 748 E TE KVKEKLRE AEDL LGK I QNP 1 1 KSNAKETLTGL KNWLLFGTQDNNT I MAE AE KL LAL L 807 

BVH11 JNR7/87 742 ETEKVKEKLREAEDLLGKIQNPIIKSNAKETLTGLKNNLLFGTQDNNTIMAEAEKLLALL 801 

BVH11 SP63 744 ETEKVEEKLREAEDLLGKIQDPIIKSNAKETLTGLKNNLLFGTQDNNTIMAEAEKLLALL 803 

BVH11 SP64 758 ETEKVEEKLREAEDLLGKIQDPIIKSNAKETLTGLKWNLLFGTQDNNTIMAEAEKLIiALL 817 
* , *_ t ** ** * t *_ ** ******* ** ******* ** _ ***** 
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AATTCCTTGT CGGGTAAGTT CCGACCCGCA CGAAAGGCGT AATGATTTGG GCACTGTCTC 60 

AACGAGAGAC TCGGTGAAAT TTTAGTACCT GTGAAGATGC AGGTTACCCG CGACAGGACG 12 0 

GAAAGACCCC ATGGAGCTTT ACTGCAGTTT GATATTGAGT GTCTGTACCA CATGTACAGG 18 0 

ATAGGTAGGA GTCTAAGAGA TCGGGACGCC AGTTTCGAAG GAGACGCTGT TGGGATACTA 240 

CCCTTGTGTT ATGGCCACTC TAACCCAGAT AGGTGATCCC TATCGGAGAC AGTGTCTGAC 300 

GGGCAGTTTG ACTGGGGCGG TCGCCTCCTA AAAGGTAACG GAGGCGCCCA AAGGTTCCCT 3 60 

CAGAATGGTT GGAAATCATT CGCAGAGTGT AAAGGTATAA GGGAGCTTGA CTGCGAGAGC 42 0 

TACAACTCGA GCAGGGACGA AAGTCGGGCT TAGTGATCCG GTGGTTCCGT ATGGAAGGGC 480 

CATCGCTCAA CGGATAAAAG CTACCCTGGG GATAACAGGC TTATCTCCCC CAAGAGTTCA 54 0 

CATCGACGGG GAGGTTTGGC ACCTCGATGT CGGCTCGTCG CATCCTGGGG CTGTAGTCGG 6 00 

TCCCAAGGGT TGGGCTGTTC GCCCATTAAA GCGGCACGCG AGCTGGGTTC AGAACGTCGT 660 

GAGACAGTTC GGTCCCTATC CGTCGCGGGC GTAGGAAATT TGAGAGGATC TGCTCCTAGT 72 0 

ACGAGAGGAC CAGAGTGGAC TTACCGCTGG TGTACCAGTT GTCTTGCCAA AGGCATCGCT 78 0 

GGGTAGCTAT GTAGGGAAGG GATAAACGCT GAAAGCATCT AAGTGTGAAA CCCACCTCAA 84 0 

GATGAGATTT CCCATGATTA TATATCAGTA AGAGCCCTGA GAGATGATCA GGTAGATAGG 900 

TTAGAAGTGG AAGTGTGGCG ACACATGTAG CGGACTAATA CTAATAGCTC GAGGACTTAT 960 

CCAAAGTAAC TGAGAATATG AAAGCGAACG GTTTTCTTAA ATTGAATAGA TATTCAATTT 102 0 

TGAGTAGGTA TTACTCAGAG TTAAGTGACG ATAGCCTAGG AGATACACCT GTACCCATGC 10 8 0 

CGAACACAGA AGTTAAGCCC TAGAACGCCG GAAGTAGTTG GGGGTTGCCC CCTGTGAGAT 114 0 

AGGGAAGTCG CTTAGCTCTA GGGAGTTTAG CTCAGCTGGG AGAGCATCTG CCTTACAAGC 12 00 

AGAGGGTCAG CGGTTCGATC CCGTTAACTC CCAAAGGTCC CGTAGTGTAG CGGTTATCAC 12 60 

GTCGCCCTGT CACGGCGAAG ATCGCGGGTT CGATTCCCGT CGGGACCGTT TAAGGTAACG 132 0 

CAAGTTATTT TAGACTCGTT AGCTCAGTTG GTAGAGCAAT TGACTTTTAA TCAATGGGTC 13 8 0 

ACTGGTTCGA GCCCAGTACG GGTCATATAT GCGGGTTTGG CGGAATTCTA ATCTCTTTGA 144 0 

AATCATCTTC TCTCACTTTC CAAAACTCTA TTACCTCTTA TTATAC C AC A TTTCAATCTT 1500 

CAACTTCCCA GTAATATAAG CACCTCTGGC GAAAGAAGTT TCAATGTCCT AAAGTAATAA 1560 

GTGAATCCAA TTCAGGAACT CCAAGAACAA AAGAAACATC TGGTGTCACA AGTATTGGAT 162 0 

GGCACAGAGT CACGTGGTAG TCTGACCCTA GCAGAAATTT TAAATAGTAA ACTATTTACT 168 0 

GGTTAATTAA ATGGTTAAAT AACCGGTTTA GAAAACTATT TAATAAAGTA AAAGAAGTTG 174 0 

AGAAAAAACT TCATCATTTA TTGAAATGAG GGATTTATGA AATTTAGTAA AAAATATATA 18 0 0 

GCAGCTGGAT CAGCTGTTAT CGTATCCTTG AGTCTATGTG CCTATGCACT AAACCAGCAT 1860 

CGTTCGCAGG AAAATAAGGA CAATAATCGT GTCTCTTATG TGGATGGCAG CCAGTCAAGT 192 0 

CAGAAAAGTG AAAACTTGAC ACCAGACCAG GTTAGCCAGA AAGAAGGAAT TCAGGCTGAG 198 0 

CAAATTGTAA TCAAAATTAC AGATCAGGGC TATGTAACGT CACACGGTGA CCACTATCAT 2 04 0 

TACTATAATG GGAAAGTTCC TTATGATGCC CTCTTTAGTG AAGAACTCTT GATGAAGGAT 2100 

CCAAACTATC AACTTAAAGA CGCTGATATT GTCAATGAAG TCAAGGGTGG TTATATCATC 2160 

AAGGTCGATG GAAAATATTA TGTCTACCTG AAAGATGCAG CTCATGCTGA TAATGTTCGA 222 0 

ACTAAAGATG AAATCAATCG TCAAAAACAA GAACATGTCA AAGATAATGA GAAGGTTAAC 22 80 

TCTAATGTTG CTGTAGCAAG GTCTCAGGGA CGATATACGA CAAATGATGG TTATGTCTTT 234 0 

AATCCAGCTG ATATTATCGA AGATACGGGT AATGCTTATA TCGTTCCTCA TGGAGGTCAC 240 0 

TATCACTACA TTCCCAAAAG CGATTTATCT GCTAGTGAAT TAGCAGCAGC TAAAGCACAT 2460 

CTGGCTGGAA AAAATATGCA ACCGAGTCAG TTAAGCTATT CTTCAACAGC TAGTGACAAT 2 52 0 

AACACGCAAT CTGTAGCAAA AGGATCAACT AGCAAGCCAG CAAATAAATC TGAAAATCTC 2 5 80 

CAGAGTCTTT TGAAGGAACT CTATGATTCA CCTAGCGCCC AACGTTACAG TGAATCAGAT 2 64 0 

GGCCTGGTCT TTGACCCTGC TAAGATTATC AGTCGTACAC CAAATGGAGT TGCGATTCCG 2 700 

CATGGCGACC ATTACCACTT TATTCCTTAC AGCAAGCTTT CTGCTTTAGA AGAAAAGATT 2760 

GCCAGAATGG TGCCTATCAG TGGAACTGGT TCTACAGTTT CTACAAATGC AAAAC CTAAT 2 82 0 

GAAGTAGTGT CTAGTCTAGG CAGTCTTTCA AGCAATCCTT CTTCTTTAAC GACAAGTAAG 2 8 80 

GAGCTCTCTT CAGCATCTGA TGGTTATATT TTTAATCCAA AAGATATCGT TGAAGAAACG 2 940 

GCTACAGCTT ATATTGTAAG ACATGGTGAT CATTTCCATT AC ATT C C AAA ATCAAATCAA 3 0 00 

ATTGGGCAAC CGACTCTTCC AAACAATAGT CTAGCAACAC CTTCTCCATC TCTTCCAATC 3 060 

AATCCAGGAA CTTCACATGA GAAACATGAA GAAGATGGAT ACGGATTTGA TGCTAATCGT 3120 

ATTATCGCTG AAGATGAATC AGGTTTTGTC ATGAGTCACG GAGACCACAA TCATTATTTC 318 0 

TTCAAGAAGG ACTTGACAGA AGAGCAAATT AAGGCTGCGC AAAAACATTT AGAGGAAGTT 3240 

AAAAC TAGTC ATAATGGATT AGATTCTTTG TCATCTCATG AACAGGATTA TCCAGGTAAT 33 00 

GCCAAAGAAA TGAAAGATTT AGATAAAAAA ATCGAAGAAA AAATTGCTGG CATTATGAAA 33 60 



CAATATGGTG TCAAACGTGA AAGTATTGTC GTGAATAAAG AAAAAAATGC GATTATTTAT 342 0 

CCGCATGGAG ATCACCATCA TGCAGATCCG ATTGATGAAC ATAAACCGGT TGGAATTGGT 34 8 0 

CATTCTCACA GTAACTATGA ACTGTTTAAA CCCGAAGAAG GAGTTGCTAA AAAAGAAGGG 354 0 

AATAAAGTTT ATACTGGAGA AGAATTAACG AATGTTGTTA ATTTGTTAAA AAATAGTACG 3 600 

TTTAATAATC AAAACTTTAC TCTAGCCAAT GGTCAAAAAC GCGTTTCTTT TAGTTTTCCG 3 66 0 

CCTGAATTGG AGAAAAAATT AGGTATCAAT ATGCTAGTAA AATTAATAAC ACCAGATGGA 3 72 0 

AAAGTATTGG AGAAAGTATC TGGTAAAGTA TTTGGAGAAG GAGTAGGGAA TATTGCAAAC 3 780 

TTTGAATTAG ATCAACCTTA TTTACCAGGA CAAACATTTA AGTATACTAT CGCTTCAAAA 3 84 0 

GATTATCCAG AAGTAAGTTA TGATGGTACA TTTACAGTTC CAACCTCTTT AGCTTACAAA 3 900 

ATGGCCAGTC AAACGATTTT CTATCCTTTC CATGCAGGGG ATACTTATTT AAGAGTGAAC 3 960 

CCTCAATTTG CAGTGCCTAA AGGAACTGAT GCTTTAGTCA GAGTGTTTGA TGAATTTCAT 4 02 0 

GGAAATGCTT ATTTAGAAAA TAACTATAAA GTTGGTGAAA TCAAATTACC GATTCCGAAA 40 80 

TTAAACCAAG GAACAACCAG AACGGCCGGA AATAAAATTC CTGTAACCTT CATGGCAAAT 414 0 

GCTTATTTGG ACAATCAATC GACTTATATT GTGGAAGTAC CTATCTTGGA AAAAGAAAAT 42 0 0 

CAAACTGATA AACCAAGTAT TCTACCACAA TTTAAAAGGA ATAAAGCACA AGAAAACTCA 42 6 0 

AAACTTGATG AAAAGGTAGA AGAACCAAAG ACTAGTGAGA AGGTAGAAAA AGAAAAACTT 43 2 0 

TCTGAAACTG GGAATAGTAC TAGTAATTCA ACGTTAGAAG AAGTTCCTAC AGTGGATCCT 43 80 

GTACAAGAAA AAGTAGCAAA ATTTGCTGAA AGTTATGGGA TGAAGCTAGA AAATGTCTTG 4440 

TTTAATATGG ACGGAACAAT TGAATTATAT TTAC CATCAG GAGAAGTCAT TAAAAAGAAT 4500 

ATGGCAGATT TTACAGGAGA AGCACCTCAA GGAAATGGTG AAAATAAACC ATCTGAAAAT 4560 

GGAAAAGTAT CTACTGGAAC AGTTGAGAAC CAACCAACAG AAAATAAACC AGCAGATTCT 462 0 

TTACCAGAGG CACCAAACGA AAAACCTGTA AAACCAGAAA ACTCAACGGA TAATGGAATG 468 0 

TTGAATCCAG AAGGGAATGT GGGGAGTGAC CCTATGTTAG ATCCAGCATT AGAGGAAGCT 4 74 0 

CCAGCAGTAG ATCCTGTACA AGAAAAATTA GAAAAATTTA CAGCTAGTTA CGGATTAGGC 4 800 

TTAGATAGTG TTATATTCAA TATGGATGGA ACGATTGAAT TAAGATTGCC AAGTGGAGAA 4 860 

GTGATAAAAA AGAATTTATC TGATTTCATA GCGTAAGGAA TAGCAGTAGA AAAAGTCTGA 492 0 

ATCAAAAATG AAGTTCTCTC AAAAGTTAGA AATAAAACTC TGACTTTGGG AGAATTTCAT 4 98 0 

TTTATTATTA ATATATAAAA TTTCTTGACA TACAACTTAA AAAGAGGTGG AATATTTACT 504 0 

AGTTAATT (SEQ ID NO : 11) 5048 
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CAGAGATCTT AGTGAATCAA ATATACTTAA GAAAAGAGGA AAGAATGAAA ATCAATAAAA 60 

AATATCTAGC TGGGTCAGTA GCTACACTTG TTTTAAGTGT CTGTGCTTAT GAACTAGGTT 12 0 

TGCATCAAGC TCAAACTGTA AAAGAAAATA ATCGTGTTTC CTATATAGAT GGAAAACAAG 180 

CGACGCAAAA AACGGAGAAT TTGACTCCTG ATGAGGTTAG CAAGCGTGAA GGAATCAACG 240 

CCGAACAAAT CGTCATCAAG ATTACGGATC AAGGTTATGT GACCTCTCAT GGAGACCATT 3 00 

ATCATTACTA TAATGGCAAG GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA 3 60 

AAGATCCGAA TTATCAGTTG AAGGATTCAG ACATTGTCAA TGAAATCAAG GGTGGTTATG 42 0 

TCATTAAGGT AAACGGTAAA TACTATGTTT AC CTTAAGGA TGCAGCTCAT GCGGATAATG 4 80 

TCCGTACAAA AGAAGAAATC AATCGGCAAA AACAAGAACA TAGTCAGCAT CGTGAAGGAG 54 0 

GGACTTCAGC AAACGATGGT GCGGTAGCCT TTGCACGTTC ACAGGGACGC TAC AC C AC AG 600 

ATGATGGTTA TATCTTCAAT GCATCTGATA TCATCGAAGA TACGGGCGAT GCCTATATCG 660 

TTCCTCATGG AGATCATTAC CATTACATTC CTAAGAATGA GTTATCAGCT AGCGAGTTGG 72 0 

CTGCTGCAGA AGCCTTCCTA TCTGGTCGGG AAAATCTGTC AAATTTAAGA ACCTATCGCC 780 

GACAAAATAG CGATAACACT CCAAGAACAA ACTGGGTACC TTCTGTAAGC AATCCAGGAA 84 0 

CTACAAATAC TAACACAAGC AACAACAGCA ACACTAACAG TCAAGCAAGT CAAAGTAATG 900 

ACATTGATAG TCTCTTGAAA CAGCTCTACA AACTGCCTTT GAGTCAACGC CATGTAGAAT 960 

CTGATGGCCT TATTTTCGAC CCAGCGCAAA TCACAAGTCG AACCGCCAGA GGTGTAGCTG 102 0 

TCCCTCATGG TAACCATTAC CACTTTATCC CTTATGAACA AATGTCTGAA TTGGAAAAAC 10 8 0 

GAATTGCTCG TATTATTCCC CTTCGTTATC GTTCAAACCA TTGGGTACCA GATTCAAGAC 1140 

CAGAAGAACC AAGTCCACAA CCGACTCCAG AACCTAGTCC AAGTCCGCAA CCTGCACCAA 12 00 

ATCCTCAACC AGCTCCAAGC AATCCAATTG ATGAGAAATT GGTCAAAGAA GCTGTTCGAA 12 60 

AAGTAGGCGA TGGTTATGTC TTTGAGGAGA ATGGAGTTTC TCGTTATATC CCAGCCAAGA 132 0 

ATCTTTCAGC AGAAACAGCA GCAGGCATTG ATAGCAAACT GGCCAAGCAG GAAAGTTTAT 13 8 0 

CTCATAAGCT AGGAGCTAAG AAAACTGACC TCCCATCTAG TGATCGAGAA TTTTACAATA 144 0 

AGGCTTATGA CTTACTAGCA AGAATTCACC AAGATTTACT TGATAATAAA GGTCGACAAG 1500 

TTGATTTTGA GGCTTTGGAT AACCTGTTGG AACGACTCAA GGATGTCTCA AGTGATAAAG 1560 

TCAAGTTAGT GGATGATATT CTTGCCTTCT TAGCTCCGAT TCGTCATCCA GAACGTTTAG 162 0 

GAAAACCAAA TGCGCAAATT ACCTACACTG ATGATGAGAT TCAAGTAGCC AAGTTGGCAG 1680 

GCAAGTACAC AACAGAAGAC GGTTATATCT TTGATCCTCG TGATATAACC AGTGATGAGG 1740 

GGGATGCCTA TGTAACTCCA CATATGACCC ATAGCCACTG GATTAAAAAA GATAGTTTGT 18 00 

CTGAAGCTGA GAGAGCGGCA GCCCAGGCTT ATGCTAAAGA GAAAGGTTTG ACCCCTCCTT 18 60 

CGACAGACCA TCAGGATTCA GGAAATACTG AGGCAAAAGG AGCAGAAGCT ATCTACAACC 192 0 

GCGTGAAAGC AGCTAAGAAG GTGCCACTTG ATCGTATGCC TTACAATCTT CAATATACTG 1980 

TAGAAGTCAA AAACGGTAGT TTAATCATAC CTCATTATGA CC ATT AC CAT AACATCAAAT 2 04 0 

TTGAGTGGTT TGACGAAGGC CTTTATGAGG CACCTAAGGG GTATACTCTT GAGGATCTTT 210 0 

TGGCGACTGT CAAGTACTAT GTCGAACATC CAAACGAACG TCCGCATTCA GATAATGGTT 2160 

TTGGTAACGC TAGCGACCAT GTTCAAAGAA ACAAAAATGG TCAAGCTGAT ACCAATCAAA 22 2 0 

CGGAAAAACC AAGCGAGGAG AAAC CTCAGA C AG AAAAAC C TGAGGAAGAA ACCCCTCGAG 22 80 

AAGAGAAACC ACAAAGCGAG AAACCAGAGT CTCCAAAACC AACAGAGGAA CCAGAAGAAG 2340 

AATCACCAGA GGAATCAGAA GAACCTCAGG TCGAGACTGA AAAGGTTGAA GAAAAACTGA 24 00 

GAGAGGCTGA AGATTTACTT GGAAAAATCC AGGATC CAAT TATCAAGTCC AATGC CAAAG 2460 

AGACTCTCAC AGGATTAAAA AATAATTTAC TATTTGGCAC CCAGGACAAC AATACTATTA 252 0 

TGGCAGAAGC TGAAAAACTA TTGGCTTTAT TAAAGGAGAG TAAGTAAAGG TAGCAGCATT 25 80 

TTCTAACTCC TAAAAACAGG ATAGGAGAAC GGGAAAACGA AAAATGAGAG CAGAATGTGA 264 0 

GTTCTAG (SED ID NO : 12 ) 2 64 7 
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GGGTCTTAAA ACTCTGAATC CTTTAGAGGC AGACCCACAA AATGACAAGA CCTATTTAGA 60 

AAATCTGGAA GAAAATATGA GTGTTCTAGC AGAAGAATTA AAGTGAGGAA AGAATGAAAA 12 0 

TCAATAAAAA ATATCTAGCA GGTTCAGTGG CAGTCCTTGC CCTAAGTGTT TGTTCCTATG 180 

AACTTGGTCG TCACCAAGCT GGTCAGGTTA AGAAAGAGTC TAATCGAGTT TCTTATATAG 24 0 

ATGGTGATCA GGCTGGTCAA AAGGCAGAAA ATTTGACACC AGATGAAGTC AGTAAGAGAG 3 00 

AGGGGATCAA CGCCGAACAA ATTGTTATCA AGATTACGGA TCAAGGTTAT GTGACCTCTC 3 60 

ATGGAGACCA TTATCATTAC TATAATGGCA AGGTTCCTTA TGATGCCATC ATCAGTGAAG 42 0 

AACTTCTCAT GAAAGATCCG AATTATCAGT TGAAGGATTC AGACATTGTC AATGAAATCA 48 0 

AGGGTGGCTA TGTGATTAAG GTAGACGGAA AATACTATGT TTACCTTAAA GATGCGGCCC 540 

ATGCGGACAA TATTCGGACA AAAGAAGAGA TTAAACGTCA GAAGCAGGAA CACAGTCATA 600 

ATCATAACTC AAGAGCAGAT AATGCTGTTG CTGCAGCCAG AGCCCAAGGA CGTTATACAA 660 

CGGATGATGG GTATATCTTC AATGCATCTG ATATCATTGA GGACACGGGT GATGCTTATA 72 0 

TCGTTCCTCA CGGCGACCAT TACCATTACA TTCCTAAGAA TGAGTTATCA GCTAGCGAGT 780 

TAGCTGCTGC AGAAGCCTAT TGGAATGGGA AGCAGGGATC TCGTCCTTCT TCAAGTTCTA 840 

GTTATAATGC AAATCCAGTT CAACCAAGAT TGTCAGAGAA CCACAATCTG ACTGTCACTC 900 

CAACTTATCA TCAAAATCAA GGGGAAAACA TTTCAAGCCT TTTACGTGAA TTGTATGCTA 960 

AACCCTTATC AGAACGCCAT GTAGAATCTG ATGGCCTTAT TTTCGACCCA GCGCAAATCA 102 0 

CAAGTCGAAC CGCCAGAGGT GTAGCTGTCC CTCATGGTAA CCATTACCAC TTTATCCCTT 10 80 

ATGAACAAAT GTCTGAATTG GAAAAACGAA TTGCTCGTAT TATTCCCCTT CGTTATCGTT 1140 

CAAACCATTG GGTACCAGAT TCAAGACCAG AACAACCAAG TCCACAATCG ACTCCGGAAC 12 00 

CTAGTCCAAG TCTGCAACCT GCACCAAATC CTCAACCAGC TCCAAGCAAT CCAATTGATG 12 60 

AGAAATTGGT CAAAGAAGCT GTTCGAAAAG TAGGCGATGG TTATGTCTTT GAGGAGAATG 132 0 

GAGTTTCTCG TTATATCCCA GCCAAGGATC TTTCAGCAGA AACAGCAGCA GGCATTGATA 13 8 0 

GCAAACTGGC CAAGCAGGAA AGTTTATCTC ATAAGCTAGG AGCTAAGAAA ACTGACCTCC 144 0 

CATCTAGTGA TCGAGAATTT TACAATAAGG CTTATGACTT ACTAGCAAGA ATTCACCAAG 150 0 

ATTTACTTGA TAATAAAGGT CGACAAGTTG ATTTTGAGGT TTTGGATAAC CTGTTGGAAC 156 0 

GACTCAAGGA TGTCTCAAGT GATAAAGTCA AGTTAGTGGA TGATATTCTT GCCTTCTTAG 162 0 

CTCCGATTCG TCATCCAGAA CGTTTAGGAA AACCAAATGC GCAAATTACC TACACTGATG 168 0 

ATGAGATTCA AGTAGCCAAG TTGGCAGGCA AGTACACAAC AGAAGACGGT TATATCTTTG 1740 

ATCCTCGTGA TATAACCAGT GATGAGGGGG ATGCCTATGT AACTCCACAT ATGAC C CAT A 1800 

GCCACTGGAT TAAAAAAGAT AGTTTGTCTG AAGCTGAGAG AGCGGCAGCC CAGGCTTATG 1860 

CTAAAGAGAA AGGTTTGACC CCTCCTTCGA CAGACCACCA GGATTCAGGA AATACTGAGG 1920 

CAAAAGGAGC AGAAGCTATC TACAACCGCG TGAAAGCAGC TAAGAAGGTG CCACTTGATC 1980 

GTATGCCTTA CAATCTTCAA TATACTGTAG AAGTCAAAAA CGGTAGTTTA ATCATACCTC 2 04 0 

ATTATGACCA TTACCATAAC ATCAAATTTG AGTGGTTTGA CGAAGGCCTT TATGAGGCAC 2100 

CTAAGGGGTA TAGTCTTGAG GATCTTTTGG CGACTGTCAA GTACTATGTC GAACATCCAA 2160 

ACGAACGTCC GCATTCAGAT AATGGTTTTG GTAACGCTAG TGACCATGTT CGTAAAAATA 222 0 

AGGCAGACCA AGATAGTAAA CCTGATGAAG ATAAGGAACA TGATGAAGTA AGTGAGCCAA 22 8 0 

CTCACCCTGA ATCTGATGAA AAAGAGAATC ACGCTGGTTT AAATCCTTCA GCAGATAATC 234 0 

TTTATAAACC AAGCACTGAT ACGGAAGAGA CAGAGGAAGA AGCTGAAGAT ACCACAGATG 24 0 0 

AGGCTGAAAT TCCTCAAGTA GAGAATTCTG TTATTAACGC TAAGATAGCA GATGCGGAGG 2460 

CCTTGCTAGA AAAAGTAACA GATCCTAGTA TTAGACAAAA TGCTATGGAG ACATTGACTG 252 0 

GTCTAAAAAG TAGTCTTCTT CTCGGAACGA AAGATAATAA CACTATTTCA GCAGAAGTAG 2580 

ATAGTCTCTT GGCTTTGTTA AAAGAAAGTC AACCGGCTCC TATACAGTAG TAAAATGAA 263 9 
(SEQ ID NO : 13) 
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MKINKKYLAG SVAVLALSVC SYELGRHQAG 
AENLTPDEVS KREGINAEQI VIKITDQGYV 
SEELLMKDPN YQLKDSDIW EIKGGYVIKV 
EEIKRQKQEH SHNHNSRADN AVAAARAQGR 
AYIVPHGDHY HYIPKNELSA SELAAAEAYW 
PRLSENHNLT VTPTYHQNQG ENISSLLREL 
QITSRTARGV AVPHGNHYHF IPYEQMSELE 
RPEQPSPQST PEPSPSLQPA PNPQPAPSNP 
ENGVSRYIPA KDLSAETAAG IDSKLAKQES 
NKAYDLLARI HQDLLDNKGR QVDFEVLDNL 
FLAPIRHPER LGKPNAQITY TDDEIQVAKL 
EGDAYVTPHM THSHWIKKDS LSEAERAAAQ 
TEAKGAEAIY NRVKAAKKVP LDRMPYNLQY 
KFEWFDEGLY EAPKGYSLED LLATVKYYVE 
KNKADQDSKP DEDKEHDEVS EPTHPESDEK 
EETEEEAEDT TDEAEIPQVE NSVINAKIAD 
LTGLKSSLLL GTKDNNTISA EVDSLLALLK 
{SEQ ID NO : 14) 



QVKKESNRVS 


YIDGDQAGQK 


50 


TSHGDHYHYY 


NGKVP YDAI I 


10 0 


DGKYYVYLKD 


7\ t\ TT7\ T^VTT Tl rni/ 

AAHADN I RTK 


lb 0 


YTTDDGYIFN 


ASDIIEDTGD 


2 00 


NGKQGSRPSS 


SSSYNANPVQ 


250 


YAKPLSERHV 


ESDGLIFDPA 


300 


KRIARIIPLR 


YRSNHWVPDS 


350 


IDEKLVKEAV 


RKVGDGYVFE 


400 


LSHKLGAKKT 


DLPSSDREFY 


450 


LERLKDVSSD 


KVKLVDDILA 


500 


AGKYTTEDGY 


IFDPRDITSD 


550 


AYAKEKGLTP 


PSTDHQDSGN 


600 


TVEVKNGSLI 


IPHYDHYHNI 


650 


HPNERPHSDN 


GFGNASDHVR 


700 


ENHAGLNPSA 


DNLYKPSTDT 


750 


AEALLEKVTD 


PS IRQNAMET 


800 


ESQPAPIQ 




838 



FIGURE 17 



TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 60 

TATGTGGATG GCAGCCAGTC AAGT CAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 12 0 

CAGAAAGAAG GAATTCAGGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGGCTATGTA 180 

ACGTCACACG GTGATCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 24 0 

AGTGAAGAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 3 00 

GAAGTCAAGG GTGGTTATAT CATCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 360 

GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 42 0 

GTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTGCTGTAG CAAGGTCTCA GGGACGATAT 48 0 

ACGACAAATG ATGGTTATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 54 0 

TATATCGTTC CTCATGGAGG TCACTATCAC TACATTCCCA AAAGCGATTT ATCTGCTAGT 60 0 

GAATTAGCAG CAGCTAAAGC ACATCTGGCT GGAAAAAATA TGCAACCGAG TCAGTTAAGC 660 

TATTCTTCAA CACCTTCTCC ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT 72 0 

GAAGAAGATG GATACGGATT TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT 780 

GTCATGAGTC ACGGAGAC C A CAATCATTAT TTCTTCAAGA AGGACTTGAC AGAAGAGCAA 84 0 

ATTAAGGCTG CGCAAAAACA TTTAGAGGAA GTTAAAACTA GTCATAATGG ATTAGATTCT 900 

TTGTCATCTC ATGAACAGGA TTATCCAAGT AATGC CAAAG AAATGAAAGA TTTAGATAAA 960 

AAAATCGAAG AAAAAATTGC TGGCATTATG AAACAATATG GTGTCAAACG TGAAAGTATT 102 0 

GTCGTGAATA AAGAAAAAAA TGCGATTATT TATCCGCATG GAGATCACCA TCATGCAGAT 1080 

CCGATTGATG AACATAAACC GGTTGGAATT GGTCATTCTC ACAGTAACTA TGAACTGTTT 114 0 

AAACCCGAAG AAGGAGTTGC TAAAAAAGAA GGGAATAAAG TTTATACTGG AGAAGAATTA 12 00 

ACGAATGTTG TTAATTTGTT AAAAAATAGT ACGTTTAATA ATCAAAACTT TACTCTAGCC 12 60 

AATGGTCAAA AACGCGTTTC TTTTAGTTTT CCGCCTGAAT TGGAGAAAAA ATTAGGTATC 132 0 

AATATGCTAG TAAAATTAAT AACACCAGAT GGAAAAGTAT TGGAGAAAGT ATCTGGTAAA 13 80 

GTATTTGGAG AAGGAGTAGG GAATATTGCA AACTTTGAAT TAGATCAACC TTATTTACCA 1440 

GGACAAACAT TTAAGTATAC TATCGCTTCA AAAGATTATC CAGAAGTAAG TTATGATGGT 1500 

ACATTTACAG TTCCAACCTC TTTAGCTTAC AAAATGGCCA GTCAAACGAT TTTCTATGCT 1560 

TTCCATGCAG GGGATACTTA TTTAAGAGTG AACCCTCAAT TTGCAGTGCC TAAAGGAACT 162 0 

GATGCTTTAG TCAGAGTGTT TGATGAATTT CATGGAAATG CTTATTTAGA AAATAACTAT 168 0 

AAAGTTGGTG AAATCAAATT ACCGATTCCG AAATTAAACC AAGGAACAAC CAGAACGGCC 1740 

GGAAATAAAA TTCCTGTAAC CTTCATGGCA AATGCTTATT TGGACAATCA ATCGACTTAT 180 0 

ATTGTGGAAG TACCTATCTT GGAAAAAGAA AATCAAACTG ATAAACCAAG TATTCTACCA 1860 

CAATTTAAAA GGAATAAAGC ACAAGAAAAC TCAAAACTTG ATGAAAAGGT AGAAGAACCA 192 0 

AAG AC TAGTG AGAAGGTAGA AAAAGAAAAA CTTTCTGAAA CTGGGAATAG TACTAGTAAT 1980 

TCAACGTTAG AAGAAGTTCC TACAGTGGAT CCTGTACAAG AAAAAGTAGC AAAATTTGCT 2 04 0 

GAAAGTTATG GGATGAAGCT AGAAAATGTC TTGTTTAATA TGGACGGAAC AATTGAATTA 210 0 

TATTTACCAT CGGGAGAAGT CATTAAAAAG AATATGGCAG ATTTTACAGG AGAAGCACCT 2160 

CAAGGAAATG GTGAAAATAA ACCATCTGAA AATGGAAAAG TATCTACTGG AACAGTTGAG 222 0 

AACCAACCAA CAGAAAATAA ACCAGCAGAT TCTTTACCAG AGGCACCAAA CGAAAAACCT 22 80 

GTAAAACCAG AAAACTCAAC GGATAATGGA ATGTTGAATC CAGAAGGGAA TGTGGGGAGT 234 0 

GACCCTATGT TAGATTCAGC ATTAGAGGAA GCTCCAGCAG TAGATCCTGT ACAAGAAAAA 24 00 

TTAGAAAAAT TTACAGCTAG TTACGGATTA GGCTTAGATA GTGTTATATT CAATATGGAT 2460 

GGAACGATTG AATTAAGATT GCCAAGTGGA GAAGTGATAA AAAAGAATTT ATTGATCTCA 252 0 

TAGCGTAA (SEQ ID NO : 15) 2528 



FIGURE 18 



V 



CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKEGIQAEQI 50 

VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YQLKDADIVN 100 

EVKGGYIIKV DGKYYVYLKD AAHADNVRTK DEINRQKQEH VKDNEKVNSN 150 

VAVARSQGRY TTNDGYVFNP ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS 2 00 

ELAAAKAHLA GKNMQPSQLS YSSTPSPSLP INPGTSHEKH EEDGYGFDAN 2 50 

RIIAEDESGF VMSHGDHNHY FFKKDLTEEQ IKAAQKHLEE VKTSHNGLDS 3 00 

LSSHEQDYPS NAKEMKDLDK KIEEKIAGIM KQYGVKRESI WNKEKNAII 350 

YPHGDHHHAD PIDEHKPVGI GHSHSNYELF KPEEGVAKKE GNKVYTGEEL 4 00 

TNVWLLKNS TFNNQNFTLA NGQKRVSFSF PPELEKKLGI NMLVKLITPD 450 

GKVLEKVSGK VFGEGVGNIA NFELDQPYLP GQTFKYTIAS KDYPEVSYDG 500 

TFTVPTSLAY KMASQTIFYP FHAGDTYLRV NPQFAVPKGT DALVRVFDEF 550 

HGNAYLENNY KVGEIKLPIP KLNQGTTRTA GNKIPVTFMA NAYLDNQSTY 600 

IVEVPILEKE NQTDKPSILP QFKRNKAQEN SKLDEKVEEP KTSEKVEKEK 650 

LSETGNSTSN STLEEVPTVD PVQEKVAKFA ESYGMKLENV LFNMDGTIEL 7 00 

YLPSGEVIKK NMADFTGEAP QGNGENKPSE NGKVSTGTVE NQPTENKPAD 750 

SLPEAPNEKP VKPENSTDNG MLNPEGNVGS DPMLDSALEE APAVDPVQEK 8 00 

LEKFTASYGL GLDSVIFNMD GTIELRLPSG EVIKKNLLIS 84 0 
(SEQ ID NO : 16) 

FIGURE 19 

CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKEGIQAEQI 50 

VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YQLKDADIVN 100 

EVKGGYIIKV DGKYYVYLKD AAHADNVRTK DEINRQKQEH VKDNEKVNSN 150 

VAVARSQGRY TTNDGYVFNP ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS 200 

ELAAAKAHLA GKNMQPSQLS YSSTASDNNT QSVAKGSTSK PANKSENLQS 250 

LLKELYDSPS AQRYSESDGL VFDPAKIISR TPNGVAIPHG DHYHFIPYSK 3 00 

LSALEEKIAR MVPISGTGST VSTNAKPNEV VSSLGSLSSN PSSLTTSKEL 35 0 

SSASDGYIFN PKDIVEETAT AYIVRHGDHF HYIPKSNQIG QPTLPNNSLA 4 00 

TPSPSLPINP GTSHEKHEED GYGFDANRII AEDESGFVMS HGDHNHYFFK 450 

KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS HEQDYPGNAK EMKDLDKKIE 50 0 

EKIAGIMKQY GVKRESIWN KEKNAIIYPH GDHHHADPID EHKPVGIGHS 55 0 

HSNYELFKPE EGVAKKEGNK VYTGEELTNV VNLLKNSTFN NQNFTLANGQ 600 

KRVSFSFPPE LEKKLGINML VKLITPDGKV LEKVSGKVFG EGVGNIANFE 65 0 

LDQPYLPGQT FKYTIASKDY PEVSYDGTFT VPTSLAYKMA SQTIFYPFHA 70 0 

GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN AYLENNYKVG EIKLPIPKLN 75 0 

QGTTRTAGNK IPVTFMANAY LDNQSTYIVE VPILEKENQT DKPSILPQFK 800 

RNKAQENSKL DEKVEEPKTS EKVEKEKLSE TGNSTSNSTL EEVPTVDPVQ 850 

EKVAKFAESY GMKLENVLFN MDGTIELYLP SGEVIKKNMA DFTGEAPQGN 90 0 

GENKPSENGK VSTGTVENQP TENKPADSLP EAPNEKPVKP ENSTDNGMLN 950 

PEGNVGSDPM LDPALEEAPA VDPVQEKLEK FTASYGLGLD SVIFNMDGTI 1000 

ELRLPSGEVI KKNLSDFIA (SEQ ID NO : 55) 1019 



FIGURE 2 0 



CAYALNQHRS QENKDNNRVS YVDGSQSSQK 
VIKITDQGYV TSHGDHYHYY NGKVPYDALF 
EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 
VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 
ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 
LLKELYDSPS AQRYSESDGL VFDPAKIISR 
LSALEEKIAR MVPISGTGST VSTNAKPNEV 
SSASDGYIFN PKDIVEETAT AYIVRHGDHF 
TPSPSLPINP GTSHEKHEED GYGFDANRII 
KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 
(SEQ ID NO : 56) 



SENLTPDQVS 


QKEGIQAEQI 


50 


SEELLMKDPN 


YQLKDADIVN 


100 


DEINRQKQEH 


VKDNEKVNSN 


150 


YIVPHGGHYH 


YIPKSDLSAS 


200 


QSVAKGSTSK 


PANKSENLQS 


250 


TPNGVAIPHG 


DHYHFIPYSK 


300 


VSSLGSLSSN 


PSSLTTSKEL 


350 


HYIPKSNQIG 


QPTLPNNSLA 


400 


AEDESGFVMS 


HGDHNHYFFK 


450 


HEQDYPGNA 




489 



FIGURE 21 



MKFSKKYIAA GSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS 60 

QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YQLKDADIVN 12 0 

EVKGGYIIKV DGKYYVYLKD AAHADNVRTK DEINRQKQEH VKDNEKVNSN VAVARSQGRY 18 0 

TTNDGYVFNP ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA GKNMQPSQLS 240 

YSSTASDNNT QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR 3 00 

TPNGVAIPHG DHYHFIPYSK LSALEEKIAR MVPISGTGST VSTNAKPNEV VSSLGSLSSN 3 60 

PSSLTTSKEL SSASDGYIFN PKDIVEETAT AYIVRHGDHF HYIPKSNQIG QPTLPNNSLA 42 0 

TPSPSLPINP GTSHEKHEED GYGFDANRII AEDESGFVMS HGDHNHYFFK KDLTEEQIKA 4 80 

AQKHLEEVKT SHNGLDSLSS HEQDYPGNA (SEQ ID NO : 57) 50 9 



FIGURE 22 



DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH 
KIAGIMKQYG VKRESIWNK EKNAIIYPHG 
SNYELFKPEE GVAKKEGNKV YTGEELTNW 
RVSFSFPPEL EKKLGINMLV KLITPDGKVL 
DQPYLPGQTF KYTIASKDYP EVSYDGTFTV 
DTYLRVNPQF AVPKGTDALV RVFDEFHGNA 
GTTRTAGNKI PVTFMANAYL DNQSTYIVEV 
NKAQENSKLD EKVEEPKTSE KVEKEKLSET 
KVAKFAESYG MKLENVLFNM DGTIELYLPS 
ENKPSENGKV STGTVENQPT ENKPADSLPE 
EGNVGSDPML D P ALE EAPAV DPVQEKLEKF 
LRLPSGEVIK KNLSDFIAKL RYRSNHWPD 
APNPQPAPSN PIDEKLVKEA VRKVGDGYVF 
GIDSKLAKQE SLSHKLGAKK TDLPSSDREF 
RQVDFEALDN LLERLKDVSS DKVKLVDDIL 
YTDDEIQVAK LAGKYTTEDG YIFDPRDITS 
SLSEAERAAA QAYAKEKGLT PPSTDHQDSG 
PLDRMPYNLQ YTVEVKNGSL IIPHYDHYHN 
DLLATVKYYV EHPNERPHSD NGFGNASDHV 
PQTEKPEEET PREEKPQSEK PESPKPTEEP 
KLREAEDLLG KIQDPIIKSN AKETLTGLKN 
ALLKESK (SEQ ID NO : 58) 



EQDYPGNAKE 


MKDLDKKIEE 


50 


DHHHADPIDE 


HKPVGIGHSH 


100 


NLLKNSTFNN 


QNFTLANGQK 


150 


EKVSGKVFGE 


GVGNIANFEL 


200 


PTSLAYKMAS 


QTIFYPFHAG 


250 


YLENNYKVGE 


IKLPIPKLNQ 


300 


PILEKENQTD 


KPSILPQFKR 


350 


GNSTSNSTLE 


EVPTVDPVQE 


400 


GEVIKKNMAD 


FTGEAPQGNG 


450 


APNEKPVKPE 


NSTDNGMLNP 


500 


TASYGLGLDS 


VIFNMDGTIE 


550 


SRPEEPSPQP 


TPEPSPSPQP 


600 


EENGVSRYIP 


AKNLSAETAA 


650 


YNKAYDLLAR 


IHQDLLDNKG 


700 


AFLAPIRHPE 


RLGKPNAQIT 


750 


DEGDAYVTPH 


MTHSHWIKKD 


800 


NTEAKGAEAI 


YNRVKAAKKV 


850 


IKFEWFDEGL 


YEAPKGYTLE 


900 


QRNKNGQADT 


NQTEKPSEEK 


950 


EEESPEESEE 


PQVETEKVEE 


1000 


NLLFGTQDNN 


TIMAEAEKLL 


1050 






1057 



FIGURE 23 



CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKEGIQAEQI 50 

VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YQLKDADIVN 100 

EVKGGYIIKV DGKYYVYLKD AAHADNVRTK DEINRQKQEH VKDNEKVNSN 150 

VAVARSQGRY TTNDGYVFNP ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS 2 00 

ELAAA (SEQ ID NO : 59) 2 05 

FIGURE 24 

CAYELGLHQA QTVKENNRVS YIDGKQATQK TENLTPDEVS KREGINAEQI 50 

VIKITDQGYV TSHGDHYHYY NGKVP YDAI I SEELLMKDPN YQLKDSDIVN 100 

EIKGGYVIKV NGKYYVYLKD AAHADNVRTK EEINRQKQEH SQHREGGTSA 150 

NDGAVAFARS QGRYTTDDGY IFNASDIIED TGDAYIVPHG DHYHYIPKNE 2 00 

LSASELAAAE AFLSGRENLS NLRTYRRQNS DNTPRTNWVP SVSNPGTTNT 250 

NTSNNSNTNS QASQSNDIDS LLKQLYKLPL SQRHVESDGL IFDPAQITSR 3 00 

TARGVAVPHG NHYHFIPYEQ MSELEKRIAR IIPLRYRSNH WVPDSRPEEP 350 

SPQPTPEPSP SPQPAPNPQP APSNPIDEKL VKEAVRKVGD GYVFEENGVS 4 00 

RYIPAKNLSA ETAAGIDSKL AKQESLSHKL GAKKTDLPSS DREFYNKAYD 450 

LLARIHQDLL DNKGRQVDFE ALDNLLERLK DVSSDKVKLV DDILAFLAPI 5 00 

RHPERLGKPN AQITYTDDEI QVAKLAGKYT TEDGYIFDPR DITSDEGDAY 550 

VTPHMTHSHW IKKDSLSEAE RAAAQAYAKE KGLTPPSTDH QDSGNTEAKG 600 

AEAIYNRVKA AKKVPLDRMP YNLQYTVEVK NGSLIIPHYD HYHNIKFEWF 650 

DEGLYEAPKG YTLEDLLATV KYYVEHPNER PHSDNGFGNA SDHVQRNKNG 700 

QADTNQTEKP SEEKPQTEKP EEETPREEKP QSEKPESPKP TEEPEEESPE 750 

ESEEPQVETE KVEEKLREAE DLLGKIQDPI IKSNAKETLT GLKNNLLFGT 80 0 

QDNNTIMAEA EKLLALLKES K {(SEQ ID NO : 60) 821 

FIGURE 25 

CAYELGLHQA QTVKENNRVS YIDGKQATQK TENLTPDEVS KREGINAEQI 5 0 

VIKITDQGYV TSHGDHYHYY NGKVPYDAI I SEELLMKDPN YQLKDSDIVN 100 

EIKGGYVIKV NGKYYVYLKD AAHADNVRTK EEINRQKQEH SQHREGGTSA 15 0 

NDGAVAFARS QGRYTTDDGY IFNASDIIED TGDAYIVPHG DHYHYIPKNE 2 00 

LSASELAAAE AFLSGRENLS NLRTYRRQNS DNTPRTNWVP SVSNPGTTNT 25 0 

NTSNNSNTNS QASQSNDIDS LLKQLYKLPL SQRHVESDGL IFDPAQITSR 3 00 

TARGVAVPHG NHYHFIPYEQ MSELEKRIAR IIPL 334 
(SEQ ID NO : 61) 

FIGURE 2 6 

RYRSNHWVPD SRPEEPSPQP TPEPSPSPQP APNPQPAPSN PIDEKLVKEA 50 

VRKVGDGYVF EENGVSRYIP AKNLSAETAA GIDSKLAKQE SLSHKLGAKK 100 

TDLPSSDREF YNKAYDLLAR IHQDLLDNKG RQVDFEALDN LLERLKDVSS 150 

DKVKLVDDIL AFLAPIRHPE RLGKPNAQIT YTDDEIQVAK LAGKYTTEDG 2 00 

YIFDPRDITS DEGDAYVTPH MTHSHWIKKD SLSEAERAAA QAYAKE KGLT 250 

PPSTDHQDSG NTEAKGAEAI YNRVKAAKKV PLDRMPYNLQ YTVEVKNGSL 3 00 

IIPHYDHYHN IKFEWFDEGL YEAPKGYTLE DLLATVKYYV EHPNERPHSD 350 

NGFGNASDHV QRNKNGQADT NQTEKPSEEK PQTEKPEEET PREEKPQSEK 400 

PESPKPTEEP EEESPEESEE PQVETEKVEE KLREAEDLLG KIQDPIIKSN 450 

AKETLTGLKN NLLFGTQDNN TIMAEAEKLL ALLKESK 487 
(SEQ ID NO : 62) 

FIGURE 2 7 



AEAFLSGREN LSNLRTYRRQ NSDNTPRTNW VPSVSNPGTT NTNTSNNSNT 50 

NSQASQSNDI DSLLKQLYKL PLSQRHVESD GLIFDPAQIT SRTARGVAVP 100 

HGNHYHFIPY EQMSELEKRI ARIIPLRYRS NHWVPDSRPE EPSPQPTPEP 150 

SPSPQPAPNP QPAPSNPIDE KLVKEAVRKV GDGYVFEENG VSRYIPAKNL 2 00 

SAETAAGIDS KLAKQESLSH KLGAKKTDLP SSDREFYNKA YDLLARIHQD 250 

LLDNKGRQVD FEALDNLLER LKDVSSDKVK LVDDILAFLA PIRHPERLGK 3 00 

PNAQITYTDD EIQVAKLAGK YTTEDGYIFD PRDITSDEGD AYVTPHMTHS 3 50 

HWIKKDSLSE AERAAAQAYA KEKGLTPPST DHQDSGNTEA KGAEAIYNRV 4 00 

KAAKKVPLDR MPYNLQYTVE VKNGSLIIPH YDHYHNIKFE WFDEGLYEAP 450 

KGYTLEDLLA TVKYYVEHPN ERPHSDNGFG NASDHVQRNK NGQADTNQTE 500 

KPSEEKPQTE KPEEETPREE KPQSEKPESP KPTEEPEEES PEESEEPQVE 550 

TEKVEEKLRE AEDLLGKIQD PIIKSNAKET LTGLKNNLLF GTQDNNTIMA 600 

EAEKLLALLK ESK (SEQ ID NO : 63) 613 

FIGURE 2 8 

DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH EQDYPGNAKE MKDLDKKIEE 5 0 

KIAGIMKQYG VKRESIVWK EKNAIIYPHG DHHHADPIDE HKPVGIGHSH 100 

SNYELFKPEE GVAKKEGNKV YTGEELTNW NLLKNSTFNN QNFTLANGQK 150 

RVSFSFPPEL EKKLGINMLV KLITPDGKVL EKVSGKVFGE GVGNIANFEL 2 00 

DQPYLPGQTF KYTIASKDYP EVSYDGTFTV PTSLAYKMAS QTIFYPFHAG 2 50 

DTYLRVNPQF AVPKGTDALV RVFDEFHGNA YLENNYKVGE IKLPIPKLNQ 3 00 

GTTRTAGNKI PVTFMANAYL DNQSTYIVEV PILEKENQTD KPSILPQFKR 35 0 

NKAQENSKLD EKYEEPKTSE KVEKEKLSET GNSTSNSTLE EVPTVDPVQE 40 0 

KVAKFAESYG MKLENVLFNM DGTIELYLPS GE V I KKNMAD FTGEAPQGNG 45 0 

ENKPSENGKV STGTVENQPT ENKPADSLPE APNEKPVKPE NSTDNGMLNP 50 0 

EGNVGSDPML DPALEEAPAV DPVQEKLEKF TASYGLGLDS VIFNMDGTIE 550 

LRLPSGEVIK KNLSDFIA (SEQ ID NO : 64) 568 

FIGURE 2 9 

DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH EQDYPGNAKE MKDLDKKIEE 50 

KIAGIMKQYG VKRESIWNK EKNAIIYPHG DHHHADPIDE HKPVGIGHSH 100 

SNYELFKPEE GVAKKEGNKV YTGEELTNW NLLKNSTFNN QNFTLANGQK 150 

RVSFSFPPEL EKKLGINMLV KLITPDGKVL EKVSGKVFGE GVGNIANFEL 2 00 

DQPYLPGQTF KYTIASKDYP EVSYDGTFTV PTSLAYKMAS QTIFYPFHAG 250 

DTYLRVNPQF AVPKGTDALV RVFDEFHGNA YLENNYKVGE IKLPIPKLNQ 3 00 

GTTRTAGNKI PVTFMANAYL DNQSTYIVE (SEQ ID NO : 65) 32 9 

FIGURE 3 0 



EVPILEKENQ TDKPSILPQF KRNKAQENSK LDEKVEEPKT SEKVEKEKLS 50 

ETGNSTSNST LEEVPTVDPV QEKVAKFAES YGMKLENVLF NMDGTIELYL 100 

PSGEVIKKNM ADFTGEAPQG NGENKPSENG KVSTGTVENQ PTENKPADSL 15 0 

PEAPNEKPVK PENSTDNGML NPEGNVGSDP MLDPALEEAP AVDPVQEKLE 2 00 

KFTASYGLGL DSVIFNMDGT IELRLPSGEV IKKNLSDFIA 24 0 
(SEQ ID NO : 66) 

FIGURE 31 

DIDSLLKQLY KLPLSQRHVE SDGLIFDPAQ ITSRTARGVA VPHGNHYHFI 50 

PYEQMSELEK RIARIIPLRY RSNHWVPDSR PEEPSPQPTP EPSPSPQPAP 100 

NPQPAPSNPI DEKLVKEAVR KVGDGYVFEE NGVSRYIPAK NLSAETAAGI 15 0 

DSKLAKQESL SHKLGAKKTD LPSSDREFYN KAYDLLARIH QDLLDNKGRQ 2 00 

VDFEALDNLL ERLKDVSSDK VKLVDDILAF LAP I RH PERL GKPNAQITYT 2 50 

DDEIQVAKLA GKYTTEDGYI FDPRDITSDE GDAYVTPHMT HSHWIKKDSL 3 00 

SEAERAAAQA YAKEKGLTPP STDHQDSGNT EAKGAEAIYN RVKAAKKVPL 3 50 

DRMPYNLQYT VEVKNGSLII PHYDHYHNIK FEWFDEGLYE APKGYTLEDL 400 

LATVKYYVEH PNERPHSDNG FGNASDHVQR NKNGQADTNQ TEKPSEEKPQ 45 0 

TEKPEEETPR EEKPQSEKPE SPKPTEEPEE ESPEESEEPQ VETEKVEEKL 50 0 

REAEDLLGKI QDPIIKSNAK ETLTGLKNNL LFGTQDNNTI MAEAEKLLAL 55 0 

LKESK (SEQ ID NO : 67) 555 

FIGURE 32 

DIDSLLKQLY KLPLSQRHVE SDGLIFDPAQ ITSRTARGVA VPHGNHYHFI 50 

PYEQMSELEK RIARIIPLRY RSNHWVPDSR PEEPSPQPTP EPSPSPQPAP 100 

NPQPAPSNPI DEKLVKEAVR KVGDGYVFEE NGVSRYIPAK NLSAETAAGI 150 

DSKLAKQESL SHKLGAKKTD LPSSDREFYN KAYDLLARIH QDLLDNKGRQ 2 00 

VDFEALDNLL ERLKDVSSDK VKLVDDILAF LAPIRHPERL GKPNAQITYT 2 50 

DDEIQVAKLA GKYTTEDGYI FDPRDITSDE GDAYVTPHMT HSHWIKKDSL 3 00 

SEAERAAAQA YAKEKGLTPP STDHQDSGNT EAKGAEAIYN RVKAAKKVPL 350 

DRMPYNLQYT VEVKNGSLII PHYDHYHNIK FEWFDEGLYE APKGYTLEDL 400 

LATVKYYVEH PNERPHSDNG FGNASDHV (SEQ ID NO : 6 8) 42 8 

FIGURE 3 3 

GLYEAPKGYT LEDLLATVKY YVEHPNERPH SDNGFGNASD HVQRNKNGQA 50 

DTNQTEKPSE EKPQTEKPEE ETPREEKPQS EKPESPKPTE EPEEESPEES 100 

EEPQVETEKV EEKLREAEDL L (SEQ ID NO : 69) 121 

FIGURE 34 

ASDHVQRNKN GQADTNQTEK PSEEKPQTEK PEEETPREEK PQSEKPESPK 5 0 

PTEEPEEESP EESEEPQVET EKVEEKLREA EDLLGKIQDP IIKSNAKETL 10 0 

TGLKNNLLFG TQDNNTIMAE AEKLLALLKE SK 132 
(SEQ ID NO : 70) 

FIGURE 35 



DIDSLLKQLY KLPLSQRHVE SDGLIFDPAQ ITSRTARGVA VPHGNHYHFI 50 

PYEQMSELEK RIARIIPLRY RSNHWVPDSR PEEPSPQPTP EPSPSPQPAP 100 

NPQPAPSNPI DEKLVKEAVR KVGDGYVFEE NGVSRYIPAK NLSAETAAGI 150 

DSKLAKQESL SHKLGAKKTD LPSSDREFYN KAYDLLARIH QDLLDNKGRQ 2 00 

VDFEALDNLL ERLKDVSSDK VKLVDD (SEQ ID NO : 71) 22 6 

FIGURE 3 6 

DILAFLAPIR HPERLGKPNA QITYTDDEIQ VAKLAGKYTT EDGYIFDPRD 50 

ITSDEGDAYV TPHMTHSHWI KKDSLSEAER AAAQAYAKEK GLTPPSTDHQ 100 

DSGNTEAKGA EAIYNRVKAA KKVPLDRMPY NLQYTVEVKN GSLIIPHYDH 150 

YHNIKFEWFD EGLYEAPKGY TLEDLLATVK YYVEHPNERP HSDNGFGNAS 2 00 

DHV (SEQ ID NO : 72) 2 03 

FIGURE 3 7 

CSYELGRHQA GQVKKESNRV SYIDGDQAGQ KAENLTPDEV SKREGINAEQ 50 

IVIKITDQGY VTSHGDHYHY YNGKVPYDAI ISEELLMKDP NYQLKDSDIV 100 

NEIKGGYVIK VDGKYYVYLK DAAHADNIRT KEEIKRQKQE HSHNHNSRAD 150 

NAVAAARAQG RYTTDDGYIF NASDIIEDTG DAYIVPHGDH YHYIPKNELS 2 00 

ASELAAAEAY WNGKQGSRPS SSSSYNANPV QPRLSENHNL TVTPTYHQNQ 2 50 

GENISSLLRE LYAKPLSERH VESDGLIFDP AQITSRTARG VAVPHGNHYH 3 00 

FIPYEQMSEL EKRIARIIPL RYRSNHWVPD SRPEQPSPQS TPEPSPSLQP 3 50 

APNPQPAPSN PIDEKLVKEA VRKVGDGYVF EENGVSRYIP AKDLSAETAA 4 00 

GIDSKLAKQE SLSHKLGAKK TDLPSSDREF YNKAYDLLAR IHQDLLDNKG 4 50 

RQVDFEVLDN LLERLKDVSS DKVKLVDDIL AFLAPIRHPE RLGKPNAQIT 50 0 

YTDDEIQVAK LAGKYTTEDG YIFDPRDITS DEGDAYVTPH MTHSHWIKKD 55 0 

SLSEAERAAA QAYAKEKGLT PPSTDHQDSG NTEAKGAEAI YNRVKAAKKV 60 0 

PLDRMPYNLQ YTVEVKNGSL IIPHYDHYHN IKFEWFDEGL YEAPKGYSLE 650 

DLLATVKYYV EHPNERPHSD NGFGNASDHV RKNKADQDSK PDEDKEHDEV 70 0 

SEPTHPESDE KENHAGLNPS ADNLYKPSTD TEETEEEAED TTDEAEIPQV 750 

ENSVINAKIA DAEALLEKVT DPSIRQNAME TLTGLKSSLL LGTKDNNTIS 800 

AEVDSLLALL KESQPAPIQ (SEQ ID NO : 73) 819 

FIGURE 3 8 

ENISSLLREL YAKPLSERHV ESDGLIFDPA QITSRTARGV AVPHGNHYHF 50 

IPYEQMSELE KRIARIIPLR YRSNHWVPDS RPEQPSPQST PEPSPSLQPA 100 

PNPQPAPSNP IDEKLVKEAV RKVGDGYVFE ENGVSRYIPA KDLSAETAAG 15 0 

IDSKLAKQES LSHKLGAKKT DLPSSDREFY NKAYDLLARI HQDLLDNKGR 200 

QVDFEVLDNL LERLKDVSSD KVKLVDDILA FLAPIRHPER LGKPNAQITY 250 

TDDEIQVAKL AGKYTTEDGY IFDPRDITSD EGDAYVTPHM THSHWIKKDS 300 

LSEAERAAAQ AYAKEKGLTP PSTDHQDSGN TEAKGAEAIY NRVKAAKKVP 350 

LDRMPYNLQY TVEVKNGSLI IPHYDHYHNI KFEWFDEGLY EAPKGYSLED 4 00 

LLATVKYYVE HPNERPHSDN GFGNASDHVR KNKADQDSKP DEDKEHDEVS 450 

EPTHPESDEK ENHAGLNPSA DNLYKPSTDT EETEEEAEDT TDEAEIPQVE 5 00 

NSVINAKIAD AEALLEKVTD PSIRQNAMET LTGLKSSLLL GTKDNNTISA 550 

EVDSLLALLK ESQPAPIQ (SEQ ID NO : 74) 568 

FIGURE 39 



VRKNKADQDS KPDEDKEHDE VSEPTHPESD EKENHAGLNP SADNLYKPST 
DTEETEEEAE DTTDEAEIPQ VENSVINAKI ADAEALLEKV TDPSIRQNAM 
ETLTGLKSSL LLGTKDNNTI SAEVDSLLAL LKESQPAPIQ 
(SEQ ID NO ; 75) 

FIGURE 4 0 



GACTTGACAG AAGAGCAAAT TAAGGCTGCG CAAAAACATT TAGAGGAAGT 
TAAAACTAGT CATAATGGAT TAGATTCTTT GTCATCTCAT GAACAGGATT 
ATCCAGGTAA TGCCAAAGAA ATGAAAGATT TAGATAAAAA AAT CGAAGAA 
AAAATTGCTG GCATTATGAA ACAATATGGT GTCAAACGTG AAAGTATTGT 
CGTGAATAAA GAAAAAAATG CGATTATTTA TCCGCATGGA GATCACCATC 
ATGCAGATCC GATTGATGAA CATAAACCGG TTGGAATTGG TCATTCTCAC 
AGTAAC TATG AACTGTTTAA ACCCGAAGAA GGAGTTGCTA AAAAAGAAGG 
GAATAAAGTT TATACTGGAG AAGAATTAAC GAATGTTGTT AATTTGTTAA 
AAAATAGTAC GTTTAATAAT CAAAACTTTA CTCTAGCCAA TGGTCAAAAA 
CGCGTTTCTT TTAGTTTTCC GCCTGAATTG GAGAAAAAAT TAGGTATCAA 
TATGCTAGTA AAATTAATAA CACCAGATGG AAAAGTATTG GAGAAAGTAT 
CTGGTAAAGT ATTTGGAGAA GGAGTAGGGA ATATTGCAAA CTTTGAATTA 
GATCAACCTT ATTTACCAGG ACAAACATTT AAGTATACTA TCGCTTCAAA 
AGATTATCCA GAAGTAAGTT ATGATGGTAC ATTTACAGTT CCAACCTCTT 
TAGCTTACAA AATGGCCAGT CAAACGATTT TCTATCCTTT CCATGCAGGG 
GATACTTATT TAAGAGTGAA CCCTCAATTT GCAGTGCCTA AAGGAACTGA 
TGCTTTAGTC AGAGTGTTTG ATGAATTTCA TGGAAATGCT TATTTAGAAA 
ATAACTATAA AGTTGGTGAA ATCAAATTAC CGATTCCGAA ATTAAACCAA 
GGAACAACCA GAACGGCCGG AAATAAAATT CCTGTAACCT TCATGGCAAA 
TGCTTATTTG G AC AAT C AAT CGACTTATAT TGTGGAAGTA CCTATCTTGG 
AAAAAGAAAA TCAAACTGAT AAACCAAGTA TTCTACCACA ATTTAAAAGG 
AATAAAGCAC AAGAAAACTC AAAACTTGAT GAAAAGGTAG AAG AAC C AAA 
GACTAGTGAG AAGGTAGAAA AAGAAAAACT TTCTGAAACT GGGAATAGTA 
CTAGTAATTC AACGTTAGAA GAAGTTCCTA CAGTGGATCC TGTACAAGAA 
AAAGTAGCAA AATTTGCTGA AAGTTATGGG ATGAAGCTAG AAAATGTCTT 
GTTTAATATG GACGGAACAA TTGAATTATA TTTACCATCA GGAGAAGTCA 
TTAAAAAGAA TATGGCAGAT TTTACAGGAG AAGCACCTCA AGGAAATGGT 
GAAAATAAAC CATCTGAAAA TGGAAAAGTA TCTACTGGAA CAGTTGAGAA 
CCAACCAACA GAAAATAAAC CAGCAGATTC TTTACCAGAG GCACCAAACG 
AAAAACCTGT AAAACCAGAA AACTCAACGG ATAATGGAAT GTTGAATCCA 
GAAGGGAATG TGGGGAGTGA CCCTATGTTA GATCCAGCAT TAGAGGAAGC 
TCCAGCAGTA GATCCTGTAC AAGAAAAATT AGAAAAATTT ACAGCTAGTT 
ACGGATTAGG CTTAGATAGT GTTATATTCA ATATGGATGG AACGATTGAA 
TTAAGATTGC CAAGTGGAGA AGTGATAAAA AAGAATTTAT CTGATTTCAT 
AGCGAAGCTT CGTTATCGTT CAAACCATTG GGTACCAGAT TCAAGACCAG 
AAG AAC C AAG TCCACAACCG ACTCCAGAAC CTAGTCCAAG TCCGCAACCT 
GCACCAAATC CTCAACCAGC TCCAAGCAAT CCAATTGATG AGAAATTGGT 
CAAAGAAGCT GTTCGAAAAG TAGGCGATGG TTATGTCTTT GAGGAGAATG 
GAGTTTCTCG TTATATCCCA GCCAAGAATC TTTCAGCAGA AACAGCAGCA 
GGCATTGATA GCAAACTGGC CAAGCAGGAA AGTTTATCTC ATAAGCTAGG 
AGCTAAGAAA ACTGACCTCC CATCTAGTGA TCGAGAATTT TACAATAAGG 
CTTATGACTT ACTAGCAAGA ATTCACCAAG ATTTACTTGA TAATAAAGGT 
CGACAAGTTG ATTTTGAGGC TTTGGATAAC CTGTTGGAAC GACTCAAGGA 
TGTCTCAAGT GATAAAGTCA AGTTAGTGGA TGATATTCTT GCCTTCTTAG 
CTCCGATTCG TCATCCAGAA CGTTTAGGAA AACCAAATGC GCAAATTACC 
TACACTGATG ATGAGATTCA AGTAGCCAAG TTGGCAGGCA AGTACACAAC 
AGAAGACGGT TATATCTTTG ATCCTCGTGA TATAACCAGT GATGAGGGGG 
ATGCCTATGT AACTCCACAT ATGACCCATA GCCACTGGAT TAAAAAAGAT 
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AGTTTGTCTG AAGCTGAGAG AGCGGCAGCC CAGGCTTATG CTAAAGAGAA 2450 

AGGTTTGACC CCTCCTTCGA CAGACCATCA GGATTCAGGA AATACTGAGG 2500 

CAAAAGGAGC AGAAGCTATC TACAACCGCG TGAAAGCAGC TAAGAAGGTG 255 0 

CCACTTGATC GTATGCCTTA CAATCTTCAA TATACTGTAG AAGTCAAAAA 2 60 0 

CGGTAGTTTA ATCATACCTC ATTATGACCA TTACCATAAC ATCAAATTTG 2650 

AGTGGTTTGA CGAAGGCCTT TATGAGGCAC CTAAGGGGTA TACTCTTGAG 2700 

GATCTTTTGG CGACTGTCAA GTACTATGTC GAACATCCAA ACGAACGTCC 2 750 

GCATTCAGAT AATGGTTTTG GTAACGCTAG CGACCATGTT CAAAGAAACA 2 80 0 

AAAATGGTCA AGCTGATACC AATCAAACGG AAAAACCAAG CGAGGAGAAA 2 850 

CCTCAGACAG AAAAACCTGA GGAAGAAACC CCTCGAGAAG AGAAACCACA 2 900 

AAGCGAGAAA CCAGAGTCTC CAAAACCAAC AGAGGAACCA GAAGAAGAAT 2 950 

CACCAGAGGA ATCAGAAGAA CCTCAGGTCG AGACTGAAAA GGTTGAAGAA 3 0 00 

AAACTGAGAG AGGCTGAAGA TTTACTTGGA AAAATCCAGG ATCCAATTAT 3 05 0 

CAAGTCCAAT GCCAAAGAGA CTCTCACAGG ATTAAAAAAT AATTTACTAT 3100 

TTGGCACCCA GGACAACAAT ACTATTATGG CAGAAGCTGA AAAACTATTG 3150 

GCTTTATTAA AGGAGAGTAA G (SEQ ID NO : 76) 3171 
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EAYWNGKQGS RPSSSSSYNA NPVQPRLSEN 
LRELYAKPLS ERHVESDGLI FDPAQITSRT 
SELEKRIARI IPLRYRSNHW VPDSRPEQPS 
PSNPIDEKLV KEAVRKVGDG YVFEENGVSR 
KQESLSHKLG AKKTDLPSSD REFYNKAYDL 
LDNLLERLKD VSSDKVKLVD DILAFLAPIR 
VAKLAGKYTT EDGYIFDPRD ITSDEGDAYV 
AAAQAYAKEK GLTPPSTDHQ DSGNTEAKGA 
NLQYTVEVKN GSLIIPHYDH YHNIKFEWFD 
YYVEHPNERP HSDNGFGNAS DHV (SEQ 



HNLTVTPTYH QNQGENISSL 50 

ARGVAVPHGN HYHFIPYEQM 100 

PQSTPEPSPS LQPAPNPQPA 15 0 

YIPAKDLSAE TAAGIDSKLA 2 00 

LARIHQDLLD NKGRQVDFEV 2 50 

HPERLGKPNA QITYTDDEIQ 3 00 

TPHMTHSHWI KKDSLSEAER 3 50 

EAIYNRVKAA KKVPLDRMPY 400 

EGLYEAPKGY SLEDLLATVK 450 

ID NO : 77) 473 
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CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKEGIQAEQI 5 0 

VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YQLKDADIVN 10 0 

EVKGGYIIKV DGKYYVYLKD AAHADNVRTK DEINRQKQEH VKDNEKVNSN 150 

VAVARSQGRY TTNDGYVFNP ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS 2 00 

ELAAAKAHLA GKNMQPSQLS YSSTASDNNT QSVAKGSTSK PANKSENLQS 2 50 

LLKELYDSPS AQRYSESDGL VFDPAKIISR TPNGVAIPHG DHYHFIPYSK 3 00 

LSALEEKIAR MVPISGTGST VSTNAKPNEV VSSLGSLSSN PSSLTTSKEL 350 

SSASDGYIFN PKDIVEETAT AYIVRHGDHF HYIPKSNQIG QPTLPNNSLA 400 

TPSPSLPINP GTSHEKHEED GYGFDANRI I AEDESGFVMS HGDHNHYFFK 450 

KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS HEQDYPGNAK EMKDLDKKIE 5 00 

EKIAGIMKQY GVKRESIWN KEKNAIIYPH GDHHHADPID EHKPVGIGHS 550 

HSNYELFKPE EGVAKKEGNK VYTGEELTNV VNLLKNSTFN NQNFTLANGQ 600 

KRVSFSFPPE LEKKLGINML VKLITPDGKV LEKVSGKVFG EGVGNIANFE 650 

LDQPYLPGQT FKYTIASKDY PEVSYDGTFT VPTSLAYKMA SQTIFYPFHA 700 

GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN AYLENNYKVG EIKLPIPKLN 75 0 

QGTTRTAGNK IPVTFMANAY LDNQSTYIVE (SEQ ID NO : 78) 780 

FIGURE 4 3 

CAYELGLHQA QTVKENNRVS YIDGKQATQK TENLTPDEVS KREGINAEQI 50 

VIKITDQGYV TSHGDHYHYY NGKVPYDAII SEELLMKDPN YQLKDSDIVN 100 

EIKGGYVIKV NGKYYVYLKD AAHADNVRTK EEINRQKQEH SQHREGGTSA 150 

NDGAVAFARS QGRYTTDDGY IFNASDIIED TGDAYIVPHG DHYHYIPKNE 2 00 

LSASELAAAE AFLSGRENLS NLRTYRRQNS DNTPRTNWVP SVSNPGTTNT 250 

NTSNNSNTNS QASQSNDIDS LLKQLYKLPL SQRHVESDGL IFDPAQITSR 3 00 

TARGVAVPHG NHYHFIPYEQ MSELEKRIAR IIPLRYRSNH WVPDSRPEEP 350 

SPQPTPEPSP SPQPAPNPQP APSNPIDEKL VKEAVRKVGD GYVFEENGVS 4 00 

RYIPAKNLSA ETAAGIDSKL AKQESLSHKL GAKKTDLPSS DREFYNKAYD 450 

LLARIHQDLL DNKGRQVDFE ALDNLLERLK DVSSDKVKLV DDILAFLAPI 5 00 

RHPERLGKPN AQITYTDDEI QVAKLAGKYT TEDGYIFDPR DITSDEGDAY 550 

VTPHMTHSHW IKKDSLSEAE RAAAQAYAKE KGLTPPSTDH QDSGNTEAKG 600 

AEAIYNRVKA AKKVPLDRMP YNLQYTVEVK NGSLIIPHYD HYHNIKFEWF 65 0 

DEGLYEAPKG YTLEDLLATV KYYVEHPNER PHSDNGFGNA 690 
(SEQ ID NO : 79) 
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GTGAAGAAAA CATATGGTTA TATCGGCTCA GTTGCTGCCA TTTTACTAGC TACTCATATT 60 

GGAAGTTACC AACTTGGTAA GCATCATATG GGTCTAGCAA CAAAGGACAA TCAGATTGCC 12 0 

TATATTGATG ACAGCAAAGG TAAGGCAAAA GCCCCTAAAA CAAACAAAAC GATGGATCAA 180 

ATCAGTGCTG AAGAAGGCAT CTCTGCTGAA CAGATCGTAG TCAAAATTAC TGACCAAGGC 24 0 

TATGTGACCT CACACGGTGA CCATTATCAT TTTTACAATG GGAAAGTTCC TTATGATGCG 3 00 

ATTATTAGTG AAGAGTTGTT GATGACGGAT CCTAATTACC GTTTTAAACA ATCAGACGTT 3 60 

ATCAATGAAA TCTTAGACGG TTACGTTATT AAAGTCAATG GCAACTATTA TGTTTACCTC 42 0 

AAGCCAGGTA GTAAGCGCAA AAACATTCGA ACCAAACAAC AAATTGCTGA GCAAGTAGCC 480 

AAAGGAACTA AAGAAGCTAA AGAAAAAGGT TTAGCTCAAG TGGCCCATCT CAGTAAAGAA 54 0 

GAAGTTGCGG CAGTCAATGA AGCAAAAAGA CAAGGACGCT ATACTACAGA CGATGGCTAT 600 

ATTTTTAGTC CGACAGATAT CATTGATGAT TTAGGAGATG CTTATTTAGT ACCTCATGGT 660 

AATCACTATC ATTATATTCC TAAAAAGGAT TTGTCTCCAA GTGAGCTAGC TGCTGCACAA 72 0 

GCCTACTGGA GTCAAAAACA AGGTCGAGGT GCTAGACCGT CTGATTACCG CCCGACACCA 780 

GCCCCAGGTC GTAGGAAAGC CCCAATTCCT GATGTGACGC CTAACCCTGG ACAAGGTCAT 84 0 

CAGCCAGATA ACGGTGGCTA TCATCCAGCG CCTCCTAGGC CAAATGATGC GTCACAAAAC 90 0 

AAACACCAAA GAGATGAGTT TAAAGGAAAA ACCTTTAAGG AACTTTTAGA TCAACTACAC 960 

CGTCTTGATT TGAAATACCG TCATGTGGAA GAAGATGGGT TGATTTTTGA ACCGACTCAA 102 0 

GTGATCAAAT CAAACGCTTT TGGGTATGTG GTGCCTCATG GAGATCATTA TCATATTATC 108 0 

CCAAGAAGTC AGTTATCACC TCTTGAAATG GAATTAGCAG ATCGATACTT AGCTGGCCAA 1140 

ACTGAGGACA ATGACTCAGG TTCAGAGCAC TCAAAACCAT CAGATAAAGA AGTGACACAT 12 00 

ACCTTTCTTG GTCATCGCAT CAAAGCTTAC GGAAAAGGCT TAGATGGTAA ACCATATGAT 12 60 

ACGAGTGATG CTTATGTTTT TAGTAAAGAA TCCATTCATT CAGTGGATAA ATCAGGAGTT 132 0 

ACAGCTAAAC ACGGAGATCA TTTCCACTAT ATAGGATTTG GAGAACTTGA ACAATATGAG 13 8 0 

TTGGATGAGG TCGCTAACTG GGTGAAAGCA AAAGGTCAAG CTGATGAGCT TGCTGCTGCT 144 0 

TTGGATCAGG AACAAGGCAA AGAAAAACCA CTCTTTGACA CTAAAAAAGT GAGTCGCAAA 15 00 

GTAACAAAAG ATGGTAAAGT GGGCTATATG ATGCCAAAAG ATGGTAAGGA CTATTTCTAT 1560 

GCTCGTGATC AACTTGATTT GACTCAGATT GCCTTTGCCG AACAAGAACT AATGCTTAAA 162 0 

GATAAGAAGC ATTACCGTTA TGACATTGTT GACACAGGTA TTGAGCCACG ACTTGCTGTA 1680 

GATGTGTCAA GTCTGCCGAT GCATGCTGGT AATGCTACTT ACGATACTGG AAGTTCGTTT 1740 

GTTATCCCAC ATATTGATCA TATCCATGTC GTTCCGTATT CATGGTTGAC GCGCGATCAG 180 0 

ATTGCAACAG TCAAGTATGT GATGCAACAC CCCGAAGTTC GTCCGGATGT ATGGTCTAAG 186 0 

CCAGGGCATG AAGAGTCAGG TTCGGTCATT CCAAATGTTA CGCCTCTTGA TAAACGTGCT 192 0 

GGTATGCCAA ACTGGCAAAT TATCCATTCT GCTGAAGAAG TTCAAAAAGC CCTAGCAGAA 1980 

GGTCGTTTTG CAACACCAGA CGGCTATATT TTCGATCCAC GAGATGTTTT GGCCAAAGAA 2 040 

ACTTTTGTAT GGAAAGATGG CTCCTTTAGC ATCCCAAGAG CAGATGGCAG TTCATTGAGA 2100 

ACCATTAATA AATCTGATCT ATCCCAAGCT GAGTGGCAAC AAGCTCAAGA GTTATTGGCA 2160 

AAGAAAAATA CTGGTGATGC TACTGATACG GATAAACCCA AAGAAAAGCA ACAGGCAGAT 222 0 

AAGAGCAATG AAAACCAACA GCCAAGTGAA GCCAGTAAAG AAGAAAAAGA ATCAGATGAC 22 8 0 

TTTATAGACA GTTTACCAGA CTATGGTCTA GATAGAGCAA CCCTAGAAGA TCATATCAAT 234 0 

CAATTAGCAC AAAAAGCTAA TATCGATCCT AAGTATCTCA TTTTCCAACC AGAAGGTGTC 24 00 

CAATTTTATA ATAAAAATGG TGAATTGGTA ACTTATGATA TCAAGACACT TCAACAAATA 24 60 

AACCCTTAA (SEQ ID NO : 80) 2469 
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VKKTYGYIGS VAAILLATHI 
APKTNKTMDQ ISAEEGISAE 
IISEELLMTD PNYRFKQSDV 
TKQQIAEQVA KGTKEAKEKG 
IFSPTDIIDD LGDAYLVPHG 
ARPSDYRPTP APGRRKAPIP 
KHQRDEFKGK TFKELLDQLH 
VPHGDHYHII PRSQLSPLEM 
TFLGHRIKAY GKGLDGKPYD 
IGFGELEQYE LDEVANWVKA 
VTKDGKVGYM MPKDGKDYFY 
DTGIEPRLAV DVSSLPMHAG 
IATVKYVMQH PEVRPDVWSK 
AEEVQKALAE GRFATPDGYI 
TINKSDLSQA EWQQAQELLA 
ASKEEKESDD FIDSLPDYGL 
QFYNKNGELV TYDIKTLQQI 



GSYQLGKHHM GLATKDNQIA 
QIWKITDQG YVTSHGDHYH 
INEILDGYVI KVNGNYYVYL 
LAQVAHLSKE EVAAVNEAKR 
NHYHYIPKKD LSPSELAAAQ 
DVTPNPGQGH QPDNGGYHPA 
RLDLKYRHVE EDGLIFEPTQ 
ELADRYLAGQ TEDNDSGSEH 
TSDAYVFSKE SIHSVDKSGV 
KGQADELAAA LDQEQGKEKP 
ARDQLDLTQI AFAEQELMLK 
NATYDTGSSF VIPHIDHIHV 
PGHEESGSVI PNVTPLDKRA 
FDPRDVLAKE TFVWKDGSFS 
KKNTGDATDT DKPKEKQQAD 
DRATLEDHIN QLAQKANTDP 
NPP (SEQ ID NO : 81) 



YIDDSKGKAK 


50 


FYNGKVPYDA 


100 


KPGSKRKNTR 


150 


QGRYTTDDGY 


200 


AYWSQKQGRG 


250 


PPRPNDASQN 


300 


VIKSNAFGYV 


350 


SKPSDKEVTH 


400 


TAKHGDHFHY 


450 


LFDTKKVSRK 


500 


DKKHYRYDIV 


550 


VPYSWLTRDQ 


600 


GMPNWQIIHS 


650 


IPRADGSSLR 


700 


KSNENQQPSE 


750 


KYLIFQPEGV 


800 




823 
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GTGAAGAAAA CATATGGTTA TATCGGCTCA GTTGCTGCCA TTTTACTAGC TACTCATATT 60 

GGAAGTTACC AACTTGGTAA GCATCATATG GGTCTAGCAA CAAAGGACAA TCAGATTGCC 12 0 

TATATTGATG ATAGCAAAGG TAAGGCAAAA GCCCCTAAAA CAAACAAAAC GATGGATCAA 180 

ATCAGTGCTG AAGAAGGCAT CTCTGCTGAA CAGATCGTAG TCAAAATTAC TGACCAAGGT 24 0 

TATGTGACCT CACACGGTGA CCATTATCAT TTTTACAATG GGAAAGTTCC TTATGATGCG 3 00 

ATTATTAGTG AAGAGTTGTT GATGACGGAT CCTAATTACC ATTTTAAACA ATCAGACGTT 3 60 

ATCAATGAAA TCTTAGACGG TTACGTTATT AAAGTCAATG GCAACTATTA TGTTTACCTC 42 0 

AAGC CAGGTA GTAAGCGCAA AAACATTCGA ACCAAACAAC AAATTGCTGA GCAAGTAGCC 4 80 

AAAGGAACTA AAGAAGCTAA AGAAAAAGGT TTAGCTCAAG TGGCCCATCT CAGTAAAGAA 54 0 

GAAGTTGCGG CAGTCAATGA AGCAAAAAGA CAAGGACGCT ATACTACAGA CGATGGCTAT 60 0 

ATTTTTAGTC CGACAGATAT CATTGATGAT TTAGGAGACG CTTATTTAGT ACCTCATGGT 660 

AATCACTATC ATTATATTCC TAAAAAAGAT TTGTCTCCAA GTGAGCTAGC TGCTGCACAA 72 0 

GCTTACTGGA GTCAAAAACA AGGTCGAGGT GCTAGACCGT CTGATTACCG CCCGACACCA 780 

GCCCCAGGTC GTAGGAAAGC TCCAATTCCT GATGTGACGC CTAACCCTGG ACAAGGTCAT 84 0 

CAGCCAGATA ACGGTGGCTA TCATCCAGCG CCTCCTAGGC CAAATGATGC GTCACAAAAC 900 

AAAC AC C AAA GAGATGAGTT TAAAGGAAAA ACCTTTAAGG AACTTTTAGA TCAACTACAC 960 

CGTCTTGATT TGAAATACCG TCATGTGGAA GAAGATGGGT TGATTTTTGA ACCGACTCAA 102 0 

GTGATCAAAT CAAACGCTTT TGGGTATGTG GTGCCTCATG GAGATCATTA TCATATTATC 1080 

CCAAGAAGTC AGTTATCACC TCTTGAAATG GAATTAGCAG ATCGATACTT AGCCGGTCAA 114 0 

ACTGAGGACA ATGATTCAGG TTCAGATCAC TCAAAACCAT CAGATAAAGA AGTGACACAT 12 00 

ACCTTTCTTG GTCATCGCAT CAAAGCTTAC GGAAAAGGCT TAGATGGTAA ACCATATGAT 12 60 

ACGAGTGATG CTTATGTTTT TAGTAAAGAA TCCATTCATT CAGTGGATAA AT C AGGAGTT 132 0 

ACAGCTAAAC ACGGAGATCA TTTCCACTAT ATAGGATTTG GAGAACTTGA ACAATATGAG 13 8 0 

TTGGATGAGG TCGCTAACTG GGTGAAAGCA AAAGGTCAAG CTGATGAGCT TGCTGCTGCT 144 0 

TTGGATCAGG AACAAGGCAA AGAAAAACCA CTCTTTGACA CTAAAAAAGT GAGTCGCAAA 150 0 

GTAACAAAAG ATGGTAAAGT GGGCTATATT ATGCCAAAAG ATGGCAAGGA CTATTTCTAT 1560 

GCTCGTGATC AACTTGATTT GACTCAGATT GCCTTTGCCG AACAAGAACT AATGCTTAAA 162 0 

GATAAGAACC ATTACCGTTA TGACATTGTT GACACAGGTA TTGAGCCACG ACTTGCTGTA 168 0 

GATGTGTCAA GTCTGCCGAT GCATGCTGGT AATGCTACTT ACGATACTGG AAGTTCGTTT 174 0 

GTTATCCCTC ATATTGATCA TATCCATGTC GTTCCGTATT CATGGTTGAC GCGCGATCAG 180 0 

ATTGCAACAA TCAAGTATGT GATGCAACAC CCCGAAGTTC GTCCAGATGT ATGGTCTAAG 1860 

CCAGGGCATG AAGAGTCAGG TTCGGTCATT CCAAATGTTA CGCCTCTTGA TAAACGTGCT 192 0 

GGTATGCCAA ATTGGCAAAT CATCCATTCT GCTGAAGAAG TTCAAAAAGC CCTAGCAGAA 1980 

GGTCGTTTTG CAACACCAGA CGGCTATATT TTCGATCCAC GAGATGTTTT GGCCAAAGAA 2 04 0 

ACTTTTGTAT GGAAAGATGG CTCCTTTAGC ATCCCAAGAG CAGATGGCAG TTCATTGAGA 2100 

ACCATTAATA AATCTGATCT ATCCCAAGCT GAGTGGCAAC AAGCTCAAGA GTTATTGGCA 2160 

AAGAAAAACG CTGGTGATGC TACTGATACG GATAAACCCA AAGAAAAGCA ACAGGCAGAT 222 0 

AAGAGCAATG AAAACCAACA GCCAAGTGAA GCCAGTAAAG AAGAAGAAAA AGAATCAGAT 22 80 

GACTTTATAG ACAGTTTACC AGACTATGGT CTAGATAGAG CAACC CTAGA AGAT CAT AT C 2 340 

AATCAATTAG CACAAAAAGC TAATATCGAT CCTAAGTATC TCATTTTCCA ACCAGAAGGT 24 00 

GTCCAATTTT ATAATAAAAA TGGTGAATTA GTAACTTATG ATATCAAGAC GCTTCAACAA 2460 

ATAAACCCTT AA (SEQ ID NO : 82) 2472 

FIGURE 47 



VKKTYGYIGS VAAILLATHI 
APKTNKTMDQ ISAEEGISAE 
IISEELLMTD PNYHFKQSDV 
TKQQIAEQVA KGTKEAKEKG 
IFSPTDIIDD LGDAYLVPHG 
ARPSDYRPTP APGRRKAPIP 
KHQRDEFKGK TFKELLDQLH 
VPHGDHYHII PRSQLSPLEM 
TFLGHRIKAY GKGLDGKPYD 
IGFGELEQYE LDEVAMWVKA 
VTKDGKVGYI MPKDGKDYFY 
DTGIEPRLAV DVSSLPMHAG 
IATIKYVMQH PEVRPDVWSK 
AEEVQKALAE GRFATPDGYI 
TINKSDLSQA EWQQAQELLA 
ASKEEEKESD DFIDSLPDYG 
VQFYNKNGEL VTYDIKTLQQ 



GSYQLGKHHM GLATKDNQIA 
QIWKITDQG YVTSHGDHYH 
INEILDGYVI KVNGNYYVYL 
LAQVAHLSKE EVAAVNEAKR 
NHYHYIPKKD LSPSELAAAQ 
DVTPNPGQGH QPDNGGYHPA 
RLDLKYRHVE EDGLIFEPTQ 
ELADRYLAGQ TEDNDSGSDH 
TSDAYVFSKE SIHSVDKSGV 
KGQADELAAA LDQEQGKEKP 
ARDQLDLTQI AFAEQELMLK 
NATYDTGSSF VIPHIDHIHV 
PGHEESGSVI PNVTPLDKRA 
FDPRDVLAKE TFVWKDGSFS 
KKNAGDATDT DKPKEKQQAD 
LDRATLEDHI NQLAQKANID 
INPP (SEQ ID NO : 82 





50 


FYNGKVPYDA 


100 


KPGSKRKNIR 


150 


QGRYTTDDGY 


200 


AYWSQKQGRG 


250 


PPRPNDASQN 


300 


VIKSNAFGYV 


350 


SKPSDKEVTH 


400 


TAKHGDHFHY 


450 


LFDTKKVSRK 


500 


DKNHYRYD IV 


550 


VPYSWLTRDQ 


600 


GMPNWQIIHS 


650 


IPRADGSSLR 


700 


KSNENQQPSE 


750 


PKYLIFQPEG 


800 


) 


824 
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